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Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

I hereby declare that I am an official empowered to act on behalf of the nonprofit organization 
identified below: 

Name of organizatio n University of Connecticut Health Center 

Address of organization 263 Farmington Avenue 



Farmington. Connecticut 06030 



Type of organization 
EI University or other institution of higher education 

□ Tax exempt under Internal Revenue Service Code (26 USC 501(a) and 501(c)(3)) 
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Receptor as a Heat Shock Protein Receptor And Uses Thereof", by inventor Pramod K. 
Srivastava, described in 



KI the specification filed herewith 
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If the rights held by the nonprofit organization are not exclusive, each individual, concern or 
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resulting in loss of entitlement to small entity status prior to paying, or at the time of paying, the 
earliest of the issue fee or any maintenance fee due after the date on which status as a small entity is 
no longer appropriate. [37 CFR 1.28 (b)] 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that 
such willful false statements may jeopardize the validity of the application, and patent issuing 
thereon, or any patent to which this verified statement is directed. 
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organization having rights to the invention averring to their status as small entities. 
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5 

This application is a continuation-in-part of co-pending application number 
09/625,137, filed July 25, 2000, claiming priority under 35 U.S.C. § 1 19(e) to provisional 
application no. 60/209,095, filed June 2, 2000, both of which are incorporated by reference 
herein in their entirety. The invention was made with government support under grant 
jq number CA64394 awarded by the National Institutes of Health. The government has 
certain rights in the invention. 

1. INTRODUCTION 

The present invention relates to the use of alpha (2) macroglobulin ("a2M") receptor 
15 as a heat shock protein receptor, cells that express the a2M receptor bound to an HSP, and 
antibodies and other molecules that bind the a2M receptor-HSP complex. The invention 
also relates to screening assays to identify compounds that modulate the interaction of an 
HSP with the a2M receptor, and methods for using compositions comprising a2M-receptor 
sequences for the diagnosis and treatment of immune disorders, proliferative disorders, and 
20 infectious diseases. 

2. BACKGROUND OF THE INVENTION 

2.1. HEAT SHOCK PROTEINS 

Heat shock proteins (HSPs), also referred to as stress proteins, were first identified 

25 

as proteins synthesized by cells in response to heat shock. Hsps have classified into five 
families, based on molecular weight, HsplOO, Hsp90, Hsp70, Hsp60, and smHsp. Many 
members of these families were found subsequently to be induced in response to other 
stressful stimuli including nutrient deprivation, metabolic disruption, oxygen radicals, and 
infection with intracellular pathogens (see Welch, May 1993, Scientific American 56-64; 
30 Young, 1990, Annu. Rev. Immunol. 8:401-420; Craig, 1993, Science 260:1902-1903; 
Gething et al, 1992, Nature 355:33-45; and Lindquist et al, 1988, Annu. Rev. Genetics 
22:631-677). 

Heat shock proteins are among the most highly conserved proteins in existence. For 
example, DnaK, the Hsp70 from E. coli has about 50% amino acid sequence identity with 

35 

Hsp70 proteins from excoriates (Bardwell et al, 1984, Proc. Natl. Acad. Sci. 81:848-852). 
The Hsp60 and Hsp90 families also show similarly high levels of intra- family conservation 
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(Hickey et al, 1989, Mol. Cell. Biol. 9:2615-2626; Jindal, 1989, Mol. Cell. Biol. 9:2279- 
2283). In addition, it has been discovered that the Hsp60, Hsp70 and Hsp90 families are 
composed of proteins that are related to the stress proteins in sequence, for example, having 
greater than 35% amino acid identity, but whose expression levels are not altered by stress. 

5 Studies on the cellular response to heat shock and other physiological stresses 

revealed that the HSPs are involved not only in cellular protection against these adverse 
conditions, but also in essential biochemical and immunological processes in unstressed 
cells. HSPs accomplish different kinds of chaperoning functions. For example, members 
of the Hsp70 family, located in the cell cytoplasm, nucleus, mitochondria, or endoplasmic 

10 reticulum (Lindquist et al, 1988, Ann. Rev. Genetics 22:631-677), are involved in the 
presentation of antigens to the cells of the immune system, and are also involved in the 
transfer, folding and assembly of proteins in normal cells. HSPs are capable of binding 
proteins or peptides, and releasing the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or low pH. 

15 

2.2. IMMUNOGENICITY OF HSP-PEPTIDE COMPLEXES 

Srivastava et al demonstrated immune response to methylcholanthrene-induced 
sarcomas of inbred mice (1988, Immunol. Today 9:78-83). In these studies, it was found 
that the molecules responsible for the individually distinct immunogenicity of these tumors 

20 were glycoproteins of 96kDa (gp96) and intracellular proteins of 84 to 86kDa (Srivastava et 
al, 1986, Proc. Natl. Acad. Sci. USA 83:3407-341 1; Ullrich et al, 1986, Proc. Natl. Acad. 
Sci. USA 83:3121-3125). Immunization of mice with gp96 or p84/86 isolated from a 
particular tumor rendered the mice immune to that particular tumor, but not to antigenically 
distinct tumors. Isolation and characterization of genes encoding gp96 and p84/86 revealed 

25 significant homology between them, and showed that gp96 and p84/86 were, respectively, 
the endoplasmic reticular and cytosolic counterparts of the same heat shock proteins 
(Srivastava et al, 1988, Immunogenetics 28:205-207; Srivastava et al, 1991, Curr. Top. 
Microbiol. Immunol. 167:109-123). Further, Hsp70 was shown to elicit immunity to the 
tumor from which it was isolated but not to antigenically distinct tumors. However, Hsp70 

30 depleted of peptides was found to lose its immunogenic activity (Udono and Srivastava, 
1993, J. Exp. Med. 178:1391-1396). These observations suggested that the heat shock 
proteins are not immunogenic per se, but form noncovalent complexes with antigenic 
peptides, and the complexes can elicit specific immunity to the antigenic peptides 
(Srivastava, 1993, Adv. Cancer Res. 62:153-177; Udono et al, 1994, J. Immunol., 

35 152:5398-5403; Suto et al, 1995, Science, 269:1585-1588). 

Noncovalent complexes of HSPs and peptide, purified from cancer cells, can be used 
for the treatment and prevention of cancer and have been described in PCT publications WO 
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96/1041 1, dated April 11, 1996, and WO 97/10001, dated March 20, 1997 (U.S. Patent No. 
5,750,119 issued April 12, 1998, and U.S. Patent No. 5,837,251 issued November 17, 1998, 
respectively, each of which is incorporated by reference herein in its entirety). The isolation 
and purification of stress protein-peptide complexes has been described, for example, from 

5 pathogen-infected cells, and can be used for the treatment and prevention of infection caused 
by the pathogen, such as viruses, and other intracellular pathogens, including bacteria, 
protozoa, fungi and parasites (see, for example, PCT Publication WO 95/24923, dated 
September 21, 1995). Immunogenic stress protein-peptide complexes can also be prepared 
by in vitro complexing of stress protein and antigenic peptides, and the uses of such 

lO complexes for the treatment and prevention of cancer and infectious diseases has been 
described in PCT publication WO 97/10000, dated March 20, 1997 (U.S. Patent No. 
6,030,618 issued February 29, 2000. The use of stress protein-peptide complexes for 
sensitizing antigen presenting cells in vitro for use in adoptive immunotherapy is described 
in PCT publication WO 97/10002, dated March 20, 1997 (see also U.S. Patent No. 5,985,270 

1 5 issued November 1 6, 1 999). 

2.3. ALPHA (2) MACRO GLOBULIN RECEPTOR 

The alpha (2) macroglobulin receptor (herein referred to interchangeably as either 
"a2MR" or "the oc2M receptor"), also known as LDL (low-density lipoprotein) receptor- 

20 Related Protein ("LRP") or CD91, is primarily expressed in liver, brain and placenta. The 
a2M receptor is a member of the low density lipoprotein receptor family. The extracellular 
domain of the human receptor comprises six 50-amino acid EGF repeats and 31 complement 
repeats of approximately 40-42 amino acids. The complement repeats are organized, from 
the amino to the carboxy-terminus, into clusters of 2, 8, 10 and 1 1 repeats, called Cluster I, 

25 II, in and IV (Herz et al y 1988, EMBO J. 7:41 19-4127). One study points to Cluster II (Cl- 
II), which contains complement repeats 3-10 (CR3-10), as the major ligand binding portion 
of the receptor (Horn et al, 1997, J. Biol. Chem. 272:13608-13613). The cc2M receptor 
plays a role in endocytosis of a diversity of ligands. In addition to a2M, other ligands of 
cc2MR include lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), 

30 urokinase-type plasminogen activator (uPA), and exotoxins. Thus, the a2M receptor plays 
roles in a variety of cellular processes, including endocytosis, antigen presentation, 
cholesterol regulation, ApoE-containing lipoprotein clearance, and chylomicron remnant 
removal. 

Human a2M is synthesized as a 1474 amino acid precursor, the first 23 of which 
35 function as a signal sequence that is cleaved to yield a 145 1 amino acid mature protein (Kan 
et al, 1985, Proc. Natl. Acad. Sci. U.S.A. 82:2282-2286). In experiments with recombinant 
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protein, the carboxy-terminal 138 amino acids of a2M (representing amino acids 1314-1451 
of the mature protein) was found to bind the receptor. This domain has been called the RBD 
(receptor-binding domain; Salvesent et al, 1992, FEBS Lett. 313:198-202; Holtet etal, 
1994, FEBS Lett. 344:242-246). An RBD variant (RBDv), a proteolytic fragment of cc2M 

5 comprising an additional 15 amino terminal residues (representing amino acids 1314-1451 of 
the mature protein) binds to the receptor with almost the same affinity as a2M-proteinase 
(Holtet et al, 1994, FEBS Lett. 344:242-246). 

Alignment of cc2MR ligands identifies a conserved domain present in the RBDs of a 
macroglobulins. The conserved sequence spans amino acids 1366-1392 of human a2M. 

10 Conserved residues within this domain are Phe 1366 , Leu 1369 , Lys 1370 , Val 1373 , Lys ]374 , Glu 1377 , 
Val 1382 , Arg 13g4 (Nielsen et al, 1996, J. Biol. Chem. 271:12909-12912). Of these, Lys 1370 and 
Lys 1374 were shown to be critical for receptor binding (Nielsen et al, 1996, J. Biol. Chem. 
271:12909-12912). 

Binding of ligands, including the binding to a2M, to a2MR is inhibited by a2MR- 
15 associated protein (RAP). RAP is a 39 kDa folding chaperone that resides in the 

endoplasmic reticulum and is required for the normal processing of a2MR. RAP has the 
ability to competitively inhibit the binding of all a2MR to all a2MR ligands tested. One 
study shows RAP to bind to complement repeats C5-C7 in cluster II (Cl-II) of a2MR (Horn 
et al, 1997, J. Biol. Chem. 272:13608-13613); another shows RAP to bind to all two 
20 complement repeat-modules in Cl-II except the C9-C10 module (Andersen et al, J. Biol. 
Chem., Mar. 24, 2000, PMID: 10747921; published electronically ahead of print). Three 
structural domains, 1, 2 and 3, have been identified in RAP, consisting of amino acid 
residues 18-1 12, 1 13-218 and 219-323, respectively. Ligand competition titration of 
recombinant RAP domains indicates that determinants for the inhibition of test ligands 
25 reside in the C-terminal regions of domains 1 and 3 (Ellgaard et al, 1997, Eur. J. Biochem. 
244:544-51). 

2.4. ANTIGEN PRESENTATION 

Major histocompatibility complex (MHC) molecules present antigens on the cell 

30 surface of antigen-presenting cells. Cytotoxic T lymphocytes (CTLs) then recognize MHC 
molecules and their associated peptides and kill the target cell. Antigens are processed by 
two distinct antigen processing routes depending upon whether their origin is intracellular or 
extracellular. Intracellular or endogenous protein antigens, i.e., antigens synthesized within 
the antigen-presenting cell, are presented by MHC class I (MHC I) molecules to CD8+ 

3 5 cytotoxic T lymphocytes. On the other hand, extracellular or exogenously synthesized 
antigenic determinants are presented on the cell surface of "specialized" or "professional" 
APCs (macrophages, for example) by MHC class II molecules to CD4+ T cells (see, 
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generally, Fundamental Immunology, W.E. Paul (ed.), New York: Raven Press, 1984). This 
compartmental segregation of antigen processing routes is important to prevent tissue 
destruction that could otherwise occur during an immune response as a result of shedding of 
neighboring cell MHC I antigens. 

5 The heat shock protein gp96 chaperones a wide array of peptides, depending upon the 

source from which gp96 is isolated (for review, see Srivastava et al, 1998, Immunity 8: 657- 
665). Tumor-derived gp96 carries tumor-antigenic peptides (Ishii et al, 1999, J. 
Immunology 162:1303-1309); gp96 preparations from virus-infected cells carry viral 
epitopes (Suto and Srivastava, 1995, Science 269:1585-1588; Nieland et al, 1998, Proc. 

10 Natl. Acad. Sci. USA 95:1800-1805), and gp96 preparations from cells transfected with 
model antigens such as ovalbumin or P-galactosidase are associated with the corresponding 
epitopes (Arnold et al, 1995, J. Exp. Med. 182:885-889; Breloer et al, 1998, Eur. J. 
Immunol. 28:1016-1021). The association of gp96 with peptides occurs in vivo (Menoret 
and Srivastava, 1999, Biochem. Biophys. Research Commun. 262:813-818). Gp96-peptide 

15 complexes, whether isolated from cells (Tamura et al, 1997, Science 278:117-120), or 
reconstituted in vitro (Blachere et al, 1997, J. Exp. Med. 186:1 183-1406) are excellent 
immunogens and have been used extensively to elicit CD8+ T cell responses specific for the 
gp96-chaperoned antigenic peptides. 

The capacity of gp96-peptide complexes to elicit an immune response is dependent 

20 upon the transfer of the peptide to MHC class I molecules of antigen-presenting cells (Suto 
and Srivastava, 1995, supra). Endogenously synthesized antigens chaperoned by gp96 in 
the endoplasmic reticulum [ER] can prime antigen-specific CD8+ T cells (or MHC I- 
restricted CTLs) in vivo; this priming of CD8+ T cells requires macrophages. However, the 
process whereby exogenously introduced gp96-peptide complexes elicit the antigen- specific 

25 CD8+ T cell response is not completely understood since there is no established pathway for 
the translocation of extracellular antigens into the class I presentation machinery. Yet 
antigenic peptides of extracellular origin associated with HSPs are somehow salvaged by 
macrophages, channeled into the endogenous pathway, and presented by MHC I molecules 
to be recognized by CD8+ lymphocytes (Suto and Srivastava, 1995, supra; Blachere et al, 

30 1997, J. Exp. Med. 186:1315-22). 

Several models have been proposed to explain the delivery of extracellular peptides 
for antigen presentation. One proposal, known as the "direct transfer" model, suggests that 
HSP-chaperoned peptides are transferred to MHC I molecules on the cell surface of 
macrophages for presentation to CD8+ T lymphocytes. Another suggestion is that soluble 

35 extracellular proteins can be trafficked to the cytosol via constitutive macropinocytosis in 
bone marrow-derived macrophages and dendritic cells (Norbury et al, 1997, Eur. J. 
Immunol. 27:280-288). Yet another proposed mechanism is that HSPs are taken up by the 
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MHC class I molecules of the macrophage, which stimulate the appropriate T cells 
(Srivastava et al, 1994, Immunogenics 39:93-98. Others have suggested that a novel 
intracellular trafficking pathway may be involved for the transport of peptides from the 
extracellular medium into the lumen of ER (Day et al, 1997, Proc. Natl. Acad. Sci. 94:8064- 

5 8069; Nicchitta, 1998, Curr. Opin. in Immunol. 10:103-109). Further suggestions include 
the involvement of phagocytes which (a) possess an ill-defined pathway to shunt protein 
from the phagosome into the cytosol where it would enter the normal class I pathway; (b) 
digest ingested material in lysosomes and regurgitate peptides for loading on the surface to 
class I molecules (Bevan, 1995, J. Exp. Med. 182:639-41). 

10 Still others have proposed a receptor-mediated pathway for the delivery of 

extracellular peptides to the cell surface of APCs for antigen presentation. In view of the 
extremely small quantity of gp96-chaperoned antigenic peptides required for immunization 
(Blachere et al, 1997, supra), and the strict dependence of immunogenicity of gp96-peptide 
complexes on functional antigen presenting cells (APCs) (Udono et al, 1994, Proc. Natl. 

15 Acad. Sci. U.S.A. 91 :3077-3081), APCs had been proposed to possess receptors for gp96 
(Srivastava et al, 1994, Immunogenetics 39:93-98). Preliminary microscopic evidence 
consistent with such receptors has been recently obtained (Binder et al, 1998, Cell Stress & 
Chaperones 3 (Supp.l):2.; Arnold-Schild et al, 1999, J. Immunol. 162: 3757-3760; and 
Wassenberg et al , 1999, J. Cell Sci. 1:12). One hypothesis is that the mannose receptor is 

20 used in the uptake of gp96, but no mechanism has been proposed for the non-glycosylated 
HSPs, such as Hsp70 (Ciupitu et al, 1998, J. Exp. Med., 187:685-691). 

The identification and characterization of specific molecules involved in HSP- 
mediated antigen presentation of peptides could provide useful reagents and techniques for 
eliciting specific immunity by HSP and HSP-peptide complexes, and for developing novel 

25 diagnostic and therapeutic methods. 

Citation or discussion of a reference herein shall not be construed as an admission 
that such is prior art to the present invention. 



3. SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the use of the alpha (2) 
macroglobulin ("a2M") receptor as a heat shock protein receptor. The invention is based, in 
part, on the Applicant's discovery that the oc2M receptor is a cell surface receptor for heat 
shock proteins. In particular, the Applicant has shown that the heat shock protein gp96 binds 
directly to the a2M receptor, and that a2M inhibits re-presentation of gp96-chaperoned 
antigenic peptides by macrophages. Because no precedent exists for receptors that recognize 
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abundant and intracellular proteins like HSPs, the discovery of an HSP cell surface receptor 
was highly unexpected. 

The present invention provides compositions comprising complexes of HSPs and the 
a2M receptor, and antibodies and other molecules that bind the HSP-cc2M receptor complex. 

5 The invention also encompasses methods for the use of the cc2M receptor as a heat shock 
protein receptor, including methods for screening for compounds that modulate the 
interaction of HSP and the a2M receptor, and methods for treatment and detection of HSP- 
a2M receptor-mediated processes and HSP-a2M receptor-related disorders and conditions, 
such as autoimmune disorders, proliferative disorders and infectious diseases. 

1 o Tne invention provides a method for identifying a compound that modulates an HSP- 

a2M receptor-mediated process, comprising: (a) contacting a test compound with a heat 
shock protein and an alpha (2) macroglobulin receptor; and (b) measuring the level of alpha 
(2) macroglobulin receptor activity or expression, such that if the level of activity or 
expression measured in (b) differs from the level of alpha (2) macroglobulin receptor activity 

1 5 in the absence of the test compound, then a compound that modulates an HSP-a2M 

receptor-mediated process is identified. In one embodiment of this method the compound 
identified is an antagonist which interferes with the interaction of the heat shock protein with 
the alpha (2) macroglobulin receptor, further comprising the step of: (c) determining 
whether the level interferes with the interaction of the heat shock protein and the alpha(2) 

20 macroglobulin receptor. In another embodiment, the test compound is an antibody specific 
for the alpha (2) macroglobulin receptor. In another embodiment, the test compound is an 
antibody specific for alpha (2) macroglobulin. In another embodiment, test compound is an 
antibody specific for a heat shock protein. In another embodiment, the test compound is a 
small molecule. In another yet embodiment, the test compound is a peptide. In another 

25 embodiment, the peptide comprises at least 5 consecutive amino acids of the alpha (2) 
macroglobulin receptor. In yet another embodiment, the peptide comprises at least 5 
consecutive amino acids of alpha (2) macroglobulin. In yet another embodiment, the peptide 
comprises at least 5 consecutive amino acids of a heat shock protein sequence. In another 
embodiment, the compound is an agonist which enhances the interaction of the heat shock 

30 protein with the alpha (2) macroglobulin receptor. In another embodiment, which the HSP- 
a2M receptor-mediated process affects an autoimmune disorder, a disease or disorder 
involving disruption of antigen presentation or endocytosis, a disease or disorder involving 
cytokine clearance or inflammation, a proliferative disorder, a viral disorder or other 
infectious disease, hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 

35 The invention also provides a method for identifying a compound that modulates an 

HSP-cc2M receptor-mediated process, comprising: (a) contacting a test compound with a 



-7- 



NY2- 1119469.2 



heat shock protein and an alpha (2) macroglobulin receptor-expressing cell; and (b) 
measuring the level of alpha (2) macroglobulin receptor activity or expression in the cell, 
such that if the level of activity or expression measured in (b) differs from the level of alpha 
(2) macroglobulin receptor activity in the absence of the test compound, then a compound 

5 that modulates an HSP-a2M receptor-mediated process is identified. In yet another 

embodiment, wherein the alpha (2) macroglobulin receptor activity measured is the ability to 
interact with a heat shock protein. 

The invention also encompasses a method for identifying a compound that modulates 
the binding of a heat shock protein to the a2M receptor, comprising: (a) contacting a heat 

10 shock protein with an alpha (2) macroglobulin receptor, or fragment, or analog, derivative or 
mimetic thereof, in the presence of a test compound; and (b) measuring the amount of heat 
shock protein bound to the alpha (2) macroglobulin receptor, or fragment, analog, derivative 
or mimetic thereof, such that if the amount of bound heat shock protein measured in (b) 
differs from the amount of bound heat shock protein measured in the absence of the test 

1 5 compound, then a compound that modulates the binding of an HSP to the a2M receptor is 
identified. In another embodiment, alpha (2) macroglobulin receptor contacted in step (a) is 
on a cell surface. In another embodiment, the alpha (2) macroglobulin receptor is 
immobilized to a solid surface. In another embodiment, the solid surface is a microtiter dish. 
In another embodiment, the amount of bound heat shock protein is measured by contacting 

20 the cell with a heat shock protein-specific antibody. In yet another embodiment, the heat 
shock protein is labeled and the amount of bound heat shock protein is measured by 
detecting the label. In another embodiment, the heat shock protein is labeled with a 
fluorescent label. 

The invention further provides a method for identifying a compound that modulates 
25 heat shock protein-mediated antigen presentation by alpha (2) macroglobulin receptor- 
expressing cells comprising: (a) adding a test compound to a mixture of alpha (2) 
macroglobulin receptor-expressing cells and a complex consisting essentially of a heat shock 
protein noncovalently associated with an antigenic molecule, under conditions conducive to 
alpha (2) macroglobulin receptor-mediated endocytosis; (b) measuring the level of antigen- 
30 specific stimulation of cytotoxic T cells by alpha (2) macroglobulin receptor-expressing 
cells, such that if the level measured in (b) differs from the level of said stimulation in the 
absence of the test compound, then a compound that modulates heat shock protein-mediated 
antigen presentation by alpha (2) macroglobulin receptor-expressing cells is identified. In 
one embodiment of this method, the step of measuring the level of the antigenic molecule 
35 presented on the cell surface of step (b) comprises: (i) adding the alpha (2) macroglobulin 
receptor-expressing cells formed in step (a) to T cells under conditions conducive to the 
activation of the T cells; and (ii) comparing the level of activation of said cytotoxic T cells 
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with the level of activation of T cells by an alpha (2) macroglobulin receptor-expressing cell 
formed in the absence of the test compound, wherein an increase of decrease in level of T 
cell activation indicates that a compound that modulates heat shock protein-mediated antigen 
presentation by alpha (2) macroglobulin receptor-expressing cells is identified. 
5 In various embodiments, the heat shock protein used in the methods of the invention 

is gp96. 

In another embodiment, the invention provides a method for detecting a heat shock 
protein-alpha (2) macroglobulin receptor-related disorder in a mammal comprising 
measuring the level of an HSP-alpha (2) macroglobulin receptor-mediated process in a 
IQ patient sample, such that if the measured level differs from the level found in clinically 
normal individuals, then a heat shock protein-alpha (2) macroglobulin receptor-related 
disorder is detected. 

The invention also encompasses kits comprising compositions of the invention. In 
one embodiment, a kit is provided, packaged in one or more containers, comprising: (a) a 

15 purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (b) an alpha (2) macroglobulin receptor polypeptide, nucleic acid 
encoding an alpha (2) macroglobulin receptor polypeptide, or cell expressing an alpha (2) 
macroglobulin receptor polypeptide. In one embodiment, the kit the alpha (2) macroglobulin 
receptor polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, 

20 or cell expressing an alpha (2) macroglobulin receptor polypeptide is purified. In another 
embodiment, the kit further comprises instructions for use in treating an autoimmune 
disorder, an infectious disease, or a proliferative disorder. 

The invention also provides a method for modulating an immune response 
comprising administering to a mammal a purified compound that modulates the interaction 

25 of a heat shock protein with the alpha (2) macroglobulin receptor. In one embodiment, the 
compound is an agonist which enhances the interaction of the heat shock protein and the 
alpha (2) macroglobulin receptor. In another embodiment of this method the compound in 
an antagonist that interferes with the interaction between the heat shock protein and the a2M 
receptor. 

30 The invention further provides a method for treating an autoimmune disorder 

comprising administering to a mammal in need of such treatment a purified compound that 
interferes with the interaction of a heat shock protein with the alpha (2) macroglobulin 
receptor. In one embodiment of this method the compound in an antagonist that interferes 
with the interaction between the heat shock protein and the a2M receptor. In one 

35 embodiment, the antagonist is an antibody specific for alpha (2) macroglobulin receptor. In 
another embodiment, the antagonist is an antibody specific for a heat shock protein. In 
another embodiment, the antagonist is a small molecule. In another embodiment, the 
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antagonist is a peptide. In another embodiment, the peptide comprises at least 5 consecutive 
amino acids of alpha (2) macroglobulin receptor. In another embodiment, the peptide 
comprises at least 5 consecutive amino acids of alpha (2) macroglobulin. In another 
embodiment, the peptide comprises at least 5 consecutive amino acids of a heat shock protein 
5 sequence. 

The invention further provides a method for increasing the immunopotency of a 
cancer cell or an infected cell comprising transforming said cell with a nucleic acid 
comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 
an alpha (2) macroglobulin receptor polypeptide. 

1 0 Still further, the invention provides a method for increasing the immunopotency of a 

cancer cell or an infected cell comprising: (a) transforming said cell with a nucleic acid 
comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 
an alpha (2) macroglobulin receptor polypeptide, and (b) administering said cell to an 
individual in need of treatment, so as to obtain an elevated immune response. 

1 5 The invention also provides a recombinant cancer cell transformed with a nucleic 

acid comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) 
encodes an alpha (2) macroglobulin receptor polypeptide. In one embodiment, the 
recombinant cell is a human cell. 

In yet another embodiment, the invention provides a recombinant infected cell 

20 transformed with a nucleic acid comprising a nucleotide sequence that (i) is operably linked 
to a promoter, and (ii) encodes an alpha (2) macroglobulin receptor polypeptide. In one 
embodiment, the recombinant cell is a human cell. 

In another embodiment, the invention provides a method for screening for molecules 
that specifically bind to an a2M receptor comprising the steps of: (a) contacting an a2M 

25 receptor with one or more test molecules under conditions conducive to binding; and 
(b) determining whether any of said test molecules specifically bind to the a2M receptor. 
In one embodiment of this method, test molecules are potential immunotherapeutic drugs. 

The invention also provides a method for identifying a compound that modulates the 
binding of an a2M receptor ligand to the <x2M receptor comprising: contacting an a2M 

30 receptor with an a2M receptor ligand, or an a2M receptor-binding fragment, analog, 
derivative, or mimetic thereof, in the presence of one or more test compound; and (b) 
measuring the amount of a2M receptor ligand, or fragment, analog, derivative or mimetic 
thereof, bound to the a2M receptor, such that if the amount of bound a2M receptor ligand 
measured in (b) differs from the amount of bound a2M measured in the absence of the test 

35 compound, then a compound that modulates the binding of an cc2M receptor ligand to the 
a2M receptor is identified. 
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In another embodiment, a method is provided for identifying a compound that 
modulates the interaction between the a2M receptor and an oc2M receptor ligand, 
comprising: (a) contacting an a2M receptor with one or more test compounds; and (b) 
measuring the level of a2M receptor activity or expression, such that if the level of activity 

5 or expression measured in (b) differs from the level of a2M receptor activity in the absence 
of one or more test compounds, then a compound that modulates the interaction between the 
a2M receptor and an a2M receptor ligand is identified. In one embodiment, the cc2M 
receptor ligand is cc2M. 

In another embodiment, a method is provided for identifying a compound that 

10 modulates antigen presentation by a2M receptor-expressing cells comprising: (a) adding one 
or more test compounds to a mixture of a2M receptor-expressing cells and a complex 
comprising an a2M receptor ligand and an antigenic molecule, under conditions conducive 
to a2M receptor-mediated endocytosis; (b) measuring the level of stimulation of antigen- 
specific cytotoxic T cells by the oc2M receptor-expressing cells, such that if the level 

15 measured in (b) differs from the level of said stimulation in the absence of the one or more 
test compounds, then a compound that modulates antigen presentation by a2M receptor- 
expressing cells is identified. 

In another embodiment, the invention provides a method for modulating an immune 
response comprising administering to a mammal a purified compound that binds to the a2M 

20 receptor in an amount effective to modulate an immune response in the mammal. 

In yet another embodiment, a method for treating or preventing a disease or disorder 
is provided comprising administering to a mammal a purified compound that binds to the 
a2M receptor in an amount effective to treat or prevent a disease or disorder in the mammal. 
In one embodiment, the disease or disorder is cancer or an infectious disease. 

25 hi a further embodiment, a method is provided for treating an autoimmune disorder 

comprising administering to a mammal in need of such treatment a purified compound that 
binds to the a2M receptor in an amount effective to treat an autoimmune disorder in the 
mammal. 

The term "HSP-a2M receptor-mediated process" as used herein refers to a process 
30 dependent and/or responsive, either directly or indirectly, to the interaction of HSP with the 
a2M receptor. Such processes include processes that result from an aberrant level of 
expression, synthesis and/or activity of a2M receptor, such as endocytic activities relating to 
the binding of the various oc2M ligands, including but not limited to HSP, a2M, lipoprotein 
complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type plasminogen 
3 5 activator (uPA), and exotoxins. Such processes include, but are not limited to, endocytosis, 
antigen presentation, cholesterol regulation, apoE-containing lipoprotein clearance, and 
chylomicron remnant removal. 
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The terms "HSP-cc2M receptor-related disorder" and "HSP-a2M receptor-related 
condition", as used herein, refers to a disorder and a condition, respectively, involving a 
HSP-a2M receptor interaction. Such disorders and conditions may result, for example, from 
an aberrant ability of the a2M receptor to interact with HSP, perhaps due to aberrant levels 

5 of HSP and/or a2M receptor expression, synthesis and/or activity relative to levels found in 
normal, unaffected, unimpaired individuals, levels found in clinically normal individuals, 
and/or levels found in a population whose levels represent a baseline, average HSP and/or 
oc2M receptor levels. Such disorders include, but are not limited to, autoimmune disorders, 
diseases and disorders involving disruption of antigen presentation and/or endocytosis, 

1 0 diseases and disorders involving cytokine clearance and/or inflammation, proliferative 
disorders, viral disorders and other infectious diseases, hypercholesterolemia, Alzheimer's 
disease, diabetes, and osteoporosis. 

The term "oc2MR ligand" as used herein, refers to a molecule capable of binding to 
the a2M receptor. Such a2MR ligands include as well as known ligands, such as, but not 

25 limited to, a2M and cc2M complexes, heat shock proteins and heat shock protein complexes, 
lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 
well known in the art. Such a2MR ligands are typically endocytosed by cell upon binding to 

20 the cc2M receptor. 

4. BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 A-C . Identification of an 80 kDa polypeptide as a putative gp96 receptor. A. 
Confocal microscopy of re-presentation-competent RAW264.7 cells stained with gp96-FITC 
(left panel) and with albumin-FITC (right panel). B. SDS-PAGE analysis of detergent 
extracts of plasma membranes from surface biotinylated RAW264.7 (re-presentation- 
competent) or P815 cells (representation-incompetent) eluted from gp96 or albumin- 
Sepharose (SA) columns and stained with silver stain (top) or avidin-peroxidase (bottom). 
^ C. gp96-SASD-I 125 was cross-linked to live peritoneal macrophages (MO) or P815 cells, 
and the cell ly sates examined by SDS-PAGE and autoradiography. Various components were 
omitted as controls, as indicated. 

FIG. 2A-B. Anti-p80 antiserum detects an 80 kDa molecule and inhibits re-presentation of 
gp96-chaperoned AH1 peptide by macrophage. A. Pre-immune and immune sera were used 
to probe blots of plasma membrane extracts of RAW264.7, peritoneal macrophages (both 
cell types re-presentation-competent), or P815 cells. B. Re-presentation of gp96-chaperoned 



- 12- 



NY2 - 1119469.2 



peptide AH1. Sera were added at the final dilution indicated. The solid cross indicates the 
level of T cell stimulation when the APCs were pulsed directly with the AH1 peptide. The 
open cross indicates the corresponding value with impulsed APCs. 

5 FIG. 3A-C. Protein microsequencing of the 80 kDa protein. A. Analysis of a single tryptic 
(GALHIYHQR) peptide by tandem- mass spectrometry. All possible b- and y-ion series 
together with identified b-ion series (red) and y-ion series (blue) are shown. B. Collision- 
induced dissociation (CID) spectrum of this peptide is shown. C. Four identified peptides 
from the cc2M receptor, peptide mass, and sequence are shown. 

10 

FIG. 4. a2-Macroglobulin inhibits re-presentation of gp96-chaperoned AH1 peptide by 
macrophage. The solid cross indicates the level of T cell stimulation when the APCs were 
pulsed directly with the AH1 peptide. The open cross indicates the corresponding value with 
unpulsed APCs. 

15 

FIG. 5. ct2M receptor is a sensor of necrotic cell death due to its ability to detect 
extracellular gp96. Conversely, receptors (psR) for phosphatidyl serine (ps) detect apoptotic 
cell death. 

20 FIG. 6 A. The mouse a2MR cDNA (SEQ ID NO: 1) and predicted open reading frame of 
murine oc2MR protein (Genbank accession no. CAA47817). B. The murine a2M protein 
(SEQ ID NO:2), with residues identified by microsequencing an 80 kDa, gp96-interacting 
fragment of the receptor highlighted in bold. 

25 FIG. 7A. The human a2M cDNA (SEQ ID NO:3) and predicted open reading frame of oc2M 
protein (SEQ ID NO:4)(Genbank accession no. Ml 1313). B. The sequence of the mature 
human oc2M protein (SEQ ID NO: 5), following cleavage of the N-terminal 23 amino acid 
signal sequence. Highlighted residues represent the 138 amino acid ct2MR-binding domain 
(RBD). Underlined residues represent an extension of the RBD that is present in a a2MR- 

30 binding, proteolytic fragment of a2M (RBDv). Bolded residues have been shown to be 

important for a2MR binding. Italicized residues represent a domain that is conserved among 
ligands of oc2MR. 

FIG. 8 A. The human a2MR cDNA (SEQ ID NO: 6) and predicted open reading frame of 
35 human a2MR protein (Genbank accession no. NP_002323). B. Primary amino acid sequence 
of human a2MR (SEQ ID NO:7). The approximate locations of complement repeat clusters 
I and II are highlighted in grey. Individual complement repeats of Cl-II are indicated as 
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follows: amino acids of CR3, 5, 7 and 9 are in italics, and amino acids of CR4, 6, 8, and 10 
are underlined. Amino acids highlighted in bold were present in an 80kDa peptide fragment 
of the mouse oc2MR that bound to gp96. The double underlined residues represent the 
predicted signal peptide. For the locations of other features of the receptor, such as the EGF 
5 repeats, see the article by (Herz et al, 1988, EMBO J. 7:41 19-4127). 



5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to compositions and methods for the use of the alpha (2) 
macroglobulin receptor (also referred to interchangeably herein as "a2MR" or "the a2M 
receptor") as a heat shock protein ("HSP") receptor. In particular, the present invention 
provides compositions comprising isolated oc2MR- ligand complexes, e.g., a2MR-HSP 
complexes, including isolated and/or recombinant cells, and antibodies, molecules and 
compounds that modulate the interaction of a2MR with an cc2MR ligand, such as HSP. The 
invention further encompasses methods for the use of a2MR as a heat shock protein receptor, 
including screening assays to identify compounds that modulate the interaction of oc2MR 
with an HSP, or other cc2MR ligand, and methods for the use of these molecules and 
complexes for the diagnosis and treatment of immune disorders, proliferative disorders, and 
infectious diseases. 

The term "<x2MR ligand" as used herein, refers to a molecule capable of binding to 
the alM receptor. Such a2MR ligands include as well as known ligands, such as, but not 
limited to, a2M and a2M complexes, heat shock proteins and heat shock protein complexes, 
lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 
well known in the art. Such a2MR ligands are typically endocytosed by cell upon binding to 
a2MR. 

An HSP useful in the practice of the invention may be selected from among any 
cellular protein that satisfies any one of the following criteria: the intracellular concentration 
of an HSP increases when a cell is exposed to a stressful stimulus; an HSP can bind other 
proteins or peptides, and can release the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or low pH; or an HSP possesses at least 35% homology with 
any cellular protein having any of the above properties. Preferably, the HSP used in the 
compositions and methods of the present invention includes, but are not limited to, HSP90, 
gp96, BiP, Hsp70, DnaK, Hsc70, PhoE calreticulin, PDI, or an sHsp, alone or in 
combination. 



- 14- 



NY2- 1119469.2 



In a preferred embodiment, an HSP is a mammalian (e.g., mouse, rat, primate, 
domestic animal such as dog, cat, cow, horse), and is most preferably, human. 

Hsps useful in the practice of the invention include, but are not limited to, members 
of the HSP60 family, HSP70 family, HSP90 family, HSP 100 family, sHSP family, 
5 calreticulin, PDI, and other proteins in the endoplasmic reticulum that contain thioredoxin- 
like domain(s), such as, but not limited to, ERp72 and ERp61 . 

HSP analogs, muteins, derivatives, and fragments can also be used in place of HSPs 
according to the invention. An HSP pepti de-binding "fragment" for use in the invention 
refers to a polypeptide comprising a HSP peptide-binding domain that is capable of 
1Q becoming non-covalently associated with a peptide to form a complex that is capable of 
eliciting an immune response. In one embodiment, an HSP peptide-binding fragment is a 
polypeptide comprising an HSP peptide-binding domain of approximately 100 to 200 amino 
acids. 

Databases can also be searched to identify sequences with various degrees of 

1 5 similarities to a query sequence using programs, such as FASTA and BLAST, which rank the 
similar sequences by alignment scores and statistics. Such nucleotide sequences of non- 
limiting examples of HSPs that can be used for preparation of the HSPs used in the methods 
of the invention are as follows: human Hsp70, Genbank Accession No. NM_005345, 
Sargent etal, 1989, Proc. Natl. Acad. Sci. U.S.A., 86:1968-1972; human Hsp90, Genbank 

20 Accession No. X15183, Yamazaki et al, Nucl. Acids Res. 17:7108; human gp96: Genbank 
Accession No. X15187, Maki et al, 1990, Proc. Natl. Acad Sci., 87: 5658-5562; human BiP: 
Genbank Accession No. Ml 9645; Ting et al, 1988, DNA 7: 275-286; human Hsp27, 
Genbank Accession No. M24743; Hickey et al, 1986, Nucleic Acids Res. 14:4127-45; 
mouse Hsp70: Genbank Accession No. M35021, Hunt et al, 1990, Gene, 87:199-204; 

25 mouse gp96: Genbank Accession No. Ml 6370, Srivastava et al, 1987, Proc. Natl. Acad. 
Sci., 85:3807-381 1; and mouse BiP: Genbank Accession No. U16277, Haas et al, 1988, 
Proc. Natl. Acad. Sci. U.S.A., 85: 2250-2254. Due to the degeneracy of the genetic code, the 
term "HSP sequence", as used herein, refers not only to the naturally occurring amino acid 
and nucleotide sequence but also encompasses all the other degenerate sequences that encode 

30 the HSP. 

The aforementioned HSP families also contain proteins that are related to HSPs in 
sequence, for example, having greater than 35% amino acid identity, but whose expression 
levels are not altered by stress. Therefore, it is contemplated that the definition of heat shock 
or stress protein, as used herein, embraces other proteins, mutants, analogs, and variants 
35 thereof having at least 35% to 55%, preferably 55% to 75%, and most preferably 75% to 

85% amino acid identity with members of these families whose expression levels in a cell are 
enhanced in response to a stressful stimulus. The determination of percent identity between 
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two sequences can also be accomplished using a mathematical algorithm. A preferred, 
non-limiting example of a mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 
87:2264-2268, modified as in Karlin and Altschul, 1993, Proc. Natl. Acad. Sci. USA 

5 90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs 
of Altschul, et al., 1990, J. Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide 
sequences homologous to a nucleic acid molecules of the invention. BLAST protein 
searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 

1Q amino acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al, 1997, Nucleic Acids Res.25:33 89-3402. Alternatively, PSI-Blast can be used 
to perform an iterated search which detects distant relationships between molecules (Altschul 
et al, 1997, supra). When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 

15 default parameters of the respective programs {e.g., XBLAST and NBLAST) can be used 
(see http://www.ncbi.nlm.nih.gov). Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers 
and Miller, 1988, CABIOS 4:1 1-17. Such an algorithm is incorporated into the ALIGN 
program (version 2.0) which is part of the GCG sequence alignment software package. 

20 When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The immunogenic HSP-peptide complexes of the invention may include any complex 
containing an HSP and a peptide that is capable of inducing an immune response in a 
mammal. The peptides are preferably noncovalently associated with the HSP. Preferred 

25 complexes may include, but are not limited to, gp96-peptide complexes, HSP90-peptide 
complexes, HSP70-peptide complexes, HSP60-peptide complexes, HSPlOO-peptide 
complexes, calreticulin-peptide complexes, and sHSP-peptide complexes. For example, the 
HSP gp96 which is present in the endoplasmic reticulum of eukaryotic cells and is related to 
the cytoplasmic HSP90's can be used to generate an effective vaccine containing a gp96- 

30 peptide complex. 

The HSPs, oc2MR, and/or antigenic molecules for use in the invention can be purified 
from natural sources, chemically synthesized, or recombinantly produced. Although the 
HSPs may be allogeneic to the patient, in a preferred embodiment, the HSPs are autologous 
to the patient to whom they are administered. 
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5.1 



COMPOSITIONS OF THE INVENTION 



The present invention provides compositions that modulate the interaction between 
a2MR and an <x2MR ligand, such as, for example, an HSP. Such compositions can be used 
in methods to elicit or modulate an immune response. Such compositions also include 

5 antibodies that specifically recognize HSP- cc2MR complexes, isolated cells that express 
HSP-a2MR complexes, and isolated and recombinant cells that contain recombinant a2MR 
and HSP sequences. In addition, in various methods of the invention, sequences encoding 
oc2MR, an HSP, and oc2M are used for immunotherapy. Such compositions can be used, for 
example, in immunotherapy against proliferative disorders, infectious diseases, and other 

10 HSP-a2MR-related disorders. Methods for the synthesis and production of such 
compositions are described herein. 

5.1.1 RECOMBINANT EXPRESSION 

In various embodiments of the invention, sequences encoding the a2MR, an HSP, 
15 a2M, or other a2MR ligand are inserted into an expression vector for propagation and 

expression in recombinant cells. Thus, in one embodiment, the a2M receptor, HSP, a2M, or 
other cc2MR ligand coding region is linked to a non-native promoter for expression in 
recombinant cells. 

The amino acid sequence of the portion of a2MR that recognizes and binds to HSPs 

20 is shown in FIG. 6B (SEQ ID NO:2). Based on the discovery by the Applicant, this portion 
of a2MR is responsible for recognizing and binding to HSPs and HSP-antigenic peptide 
complexes. After binding HSPs, a2MR facilitates transport of the HSP-antigenic peptide 
complex into the cell, where the peptide antigens associate with MHC class I molecules and 
are then presented on the cell surface of the cell, and become available to stimulate an 

25 immune response. Based on this invention, compositions comprising agonists and 

antagonists of cc2MR and HSPs interactions can be used to modulate the immune response. 
Thus, recombinant a2MR polypeptides, complexes of ct2MR and an HSP or HSP-antigenic 
peptide complexes, and recombinant cells expressing a2MR or complexes comprising a2MR 
and antigenic peptides can be used in methods for immunotherapy and diagnostic methods 

30 described herein. 

In various embodiments of the invention, sequences encoding the a2MR, and/or a 
heat shock protein or oc2M, or fragments thereof, are inserted into an expression vector for 
propagation and expression in recombinant cells. An expression construct, as used herein, 
refers to a nucleotide sequence encoding a particular gene product, such as the a2MR, HSP 

35 or a2M, operably associated with one or more regulatory regions which allows expression of 
the encoded gene product in an appropriate host cell. "Operably-associated" refers to an 
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association in which the regulatory regions and the nucleotide sequence encoding the gene 
product to be expressed are joined and positioned in such a way as to permit transcription, 
and ultimately, translation. 

The DNA may be obtained from known sequences derived from sequence databases 

5 by standard procedures known in the art by DNA amplification or molecular cloning directly 
from a tissue, cell culture, or cloned DNA (e.g., a DNA "library"). Any eukaryotic cell may 
serve as the nucleic acid source for obtaining the coding region of an hsp gene. Nucleic acid 
sequences encoding HSPs can be isolated from vertebrate, mammalian, as well as primate 
sources, including humans. Clones derived from genomic DNA may contain regulatory and 

10 intron DNA regions in addition to coding regions; clones derived from cDNA will contain 
only exon sequences. Whatever the source, the hsp gene should be cloned into a suitable 
vector for propagation of the gene. 

Vectors based on E. coli are the most popular and versatile systems for high level 
expression of foreign proteins (Makrides, 1996, Microbiol Rev, 60:512-538). Non-limiting 

1 5 examples of regulatory regions that can be used for expression in E. coli may include but not 
limited to lac, trp, Ipp, phoA, recA, tac, XP L and phage T3 and T7 promoters (Makrides, 
1996, Microbiol Rev, 60:512-538). Non-limiting examples of prokaryotic expression vectors 
may include the Xgt vector series such as Xgtl 1 (Huynh et al., 1984 in "DNA Cloning 
Techniques", Vol. I: A Practical Approach (D. Glover, ed.), pp. 49-78, IRL Press, Oxford), 

20 and the pET vector series (Studier et al., 1990, Methods Enzymol., 185:60-89). However, a 
potential drawback of a prokaryotic host- vector system is the inability to perform many of 
the post-translational processing events of mammalian cells. Thus, an eukaryotic host-vector 
system is preferred, a mammalian host-vector system is more preferred, and a human host- 
vector system is the most preferred. 

25 The regulatory regions necessary for transcription of an oc2MR sequence, for 

example, can be provided by the expression vector. A translation initiation codon (ATG) 
may also be provided to express a nucleotide sequence encoding an a2M receptor that lacks 
an initiation codon. In a compatible host-construct system, cellular proteins required for 
transcription, such as RNA polymerase and transcription factors, will bind to the regulatory 

30 regions on the expression construct to effect transcription of the oc2MR sequence in the host 
organism. The precise nature of the regulatory regions needed for gene expression may vary 
from host cell to host cell. Generally, a promoter is required which is capable of binding 
RNA polymerase to initiate the transcription of an operably- associated nucleic acid 
sequence. Such regulatory regions may include those 5'-non-coding sequences involved with 

35 initiation of transcription and translation, such as the TATA box, the cap site, a CAAT box, 
and the like. The non-coding region 3' to the coding sequence may contain transcriptional 
termination regulatory sequences, such as terminators and polyadenylation sites. 
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Both constitutive and inducible regulatory regions may be used for expression of the 
a2M receptor, HSP, cc2M, or other cc2MR ligand. It may be desirable to use inducible 
promoters when the conditions optimal for growth of the recombinant cells and the 
conditions for high level expression of the gene product are different. Examples of useful 

5 regulatory regions are provided in the next section below. 

For expression of the cc2M receptor, HSP, a2M, or other a2MR ligand gene product 
in mammalian host cells, a variety of regulatory regions can be used, for example, the SV40 
early and late promoters, the cytomegalovirus (CMV) immediate early promoter, and the 
Rous sarcoma virus long terminal repeat (RS V-LTR) promoter. Inducible promoters that 

10 may be useful in mammalian cells include but are not limited to those associated with the 
metallothionein II gene, mouse mammary tumor virus glucocorticoid responsive long 
terminal repeats (MMTV-LTR), the p-interferon gene, and the Hsp70 gene (Williams et al, 
1989, Cancer Res. 49:2735-42; Taylor et al, 1990, Mol. Cell Biol., 10:165-75). It may be 
advantageous to use heat shock promoters or stress promoters to drive expression of ct2MR 

1 5 in recombinant host cells. 

The following animal regulatory regions, which exhibit tissue specificity and have 
been utilized in transgenic animals, can also be used in tumor cells of a particular tissue type: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-646; Ornitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 

20 MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 
pancreatic beta cells (Hanahan, 1985, Nature 315:1 15-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al, 1985, Nature 318:533-538; Alexander et al, 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 

25 and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 
active in liver (Pinkert et al., 1987, Genes and Devel. 1:268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; 
Hammer et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
active in the liver (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta-globin gene 

30 control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
Kollias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et al, 1987, Cell 48:703-712); myosin light 
chain-2 gene control region which is active in skeletal muscle (Sani, 1985, Nature 314:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 

35 hypothalamus (Mason et al., 1986, Science 234:1372-1378). 

The efficiency of expression of the oc2M receptor in a host cell may be enhanced by 
the inclusion of appropriate transcription enhancer elements in the expression vector, such as 
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those found in SV40 virus, Hepatitis B virus, cytomegalovirus, immunoglobulin genes, 
metallothionein, [3-actin (see Bittner et al, 1987, Methods in Enzymol. 153:516-544; 
Gorman, 1990, Curr. Op. in Biotechnol. 1:36-47). 

The expression vector may also contain sequences that permit maintenance and 
5 replication of the vector in more than one type of host cell, or integration of the vector into 
the host chromosome. Such sequences may include but are not limited to replication origins, 
autonomously replicating sequences (ARS), centromere DNA, and telomere DNA. It may 
also be advantageous to use shuttle vectors that can be replicated and maintained in at least 
two types of host cells. 

1 0 In addition, the expression vector may contain selectable or screenable marker genes 

for initially isolating or identifying host cells that contain DNA encoding an a2M receptor. 
For long term, high yield production of a2M receptor, stable expression in mammalian cells 
is preferred. A number of selection systems may be used for mammalian cells, including, but 
not limited, to the Herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 1 1 :223), 

25 hypoxanthine-guanine phosphoribosyltransferase (Szybalski and Szybalski, 1962, Proc. Natl. 
Acad. Sci. USA 48:2026), and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 
22:817) genes can be employed in tk~, hgprf or aprf cells, respectively. Also, antimetabolite 
resistance can be used as the basis of selection for dihydrofolate reductase (dhfr), which 
confers resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; 

20 O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to 
mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neomycin 
phosphotransferase (neo), which confers resistance to the aminoglycoside G-418 (Colberre- 
Garapin et al., 1981, J. Mol. Biol. 150:1); and hygromycin phosphotransferase (hyg), which 
confers resistance to hygromycin (Santerre et al., 1984, Gene 30:147). Other selectable 

25 markers, such as but not limited to histidinol and Zeocin™ can also be used. 

In order to insert the DNA sequence encoding a2M receptor, HSP, ct2M, or other 
a2MR ligand into the cloning site of a vector, DNA sequences with regulatory functions, 
such as promoters, must be attached to DNA sequences encoding the a2M receptor, HSP, 
a2M, or other oc2MR ligand, respectively. To do this, linkers or adapters providing the 

30 appropriate compatible restriction sites may be ligated to the ends of cDNA or synthetic 
DNA encoding an a2M receptor, by techniques well known in the art (Wu et al., 1987, 
Methods in Enzymol 152:343-349). Cleavage with a restriction enzyme can be followed by 
modification to create blunt ends by digesting back or filling in single-stranded DNA termini 
before ligation. Alternatively, a desired restriction enzyme site can be introduced into a 

35 fragment of DNA by amplification of the DNA by use of PCR with primers containing the 
desired restriction enzyme site. 
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In one embodiment, an expression construct comprising an cc2M receptor sequence 
operably associated with regulatory regions can be directly introduced into appropriate host 
cells for expression and production of cc2MR without further cloning (see, for example, U.S. 
Patent No. 5,580,859). The expression constructs may also contain DNA sequences that 

5 facilitate integration of the a2M receptor sequence into the genome of the host cell, e.g., via 
homologous recombination. In this instance, it is not necessary to employ an expression 
vector comprising a replication origin suitable for appropriate host cells in order to propagate 
and express the u2M receptor in the host cells. 

Expression constructs containing cloned nucleotide sequence encoding the cx2M 

IQ receptor, an HSP, a2M, or other oc2MR ligand, can be introduced into the host cell by a 
variety of techniques known in the art, including but not limited to, for prokaryotic cells, 
bacterial transformation (Hanahan, 1985, in DNA Cloning, A Practical Approach, 1:109- 
136), and for eukaryotic cells, calcium phosphate mediated transfection (Wigler et al., 1977, 
Cell 1 1:223-232), liposome-mediated transfection (Schaefer-Ridder et al., 1982, Science 

15 215:166-168), electroporation (Wolff et al., 1987, Proc Natl Acad Sci 84:3344), and 
microinjection (Cappechi, 1980, Cell 22:479-488). 

For long term, high yield production of properly processed cc2M receptor, HSP, oc2M, 
or other a2MR ligand, stable expression in mammalian cells is preferred. Cell lines that 
stably express the a2M receptor, HSP, ct2M, or other a2MR ligand or a2MR-peptide 

2Q complexes may be engineered by using a vector that contains a selectable marker. By way of 
example but not limitation, following the introduction of the expression constructs, 
engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are 
switched to a selective media. The selectable marker in the expression construct confers 
resistance to the selection and optimally allows cells to stably integrate the expression 

25 construct into their chromosomes and to grow in culture and to be expanded into cell lines. 
Such cells can be cultured for a long period of time while the desired gene product is 
expressed continuously. 

The recombinant cells may be cultured under standard conditions of temperature, 
incubation time, optical density, and media composition. Alternatively, recombinant 

30 antigenic cells may be cultured under conditions emulating the nutritional and physiological 
requirements of the cancer cell or infected cell. However, conditions for growth of 
recombinant cells may be different from those for expression of the cc2M receptor, HSPs, 
a2M, or other a2MR ligand, or antigenic peptide. 

35 
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5. 1 .2 PEPTIDE SYNTHESIS 



An alternative to producing peptides and polypeptides comprising HSP, a2M 
receptor, oc2M or other oc2MR ligand sequences, by recombinant techniques is peptide 
synthesis. For example, a peptide corresponding to a portion of an HSP or an <x2M peptide 

5 comprising the receptor-binding domain, which can be used as an antagonist in the 

therapeutic methods described herein, can be synthesized by use of a peptide synthesizer. 
Synthetic peptides corresponding to oc2M receptor sequences useful for therapeutic methods 
described herein can also be produced synthetically. Conventional peptide synthesis may be 
used or other synthetic protocols well known in the art. 

10 For example, peptides having the amino acid sequence of the a2M receptor, an HSP, 

oc2M, or other cc2MR ligand, or an analog, mutein, fragment, or derivative thereof, may be 
synthesized by solid-phase peptide synthesis using procedures similar to those described by 
Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-cc-protected amino acids 
having protected side chains are added stepwise to a growing polypeptide chain linked by its 

15 C-terminal and to an insoluble polymeric support i.e., polystyrene beads. The peptides are 
synthesized by linking an amino group of an N-a-deprotected amino acid to an a-carboxyl 
group of an N-oc -protected amino acid that has been activated by reacting it with a reagent 
such as dicyclohexylcarbodiimide. The attachment of a free amino group to the activated 
carboxyl leads to peptide bond formation. The most commonly used N-a-protecting groups 

20 include Boc which is acid labile and Fmoc which is base labile. Details of appropriate 

chemistries, resins, protecting groups, protected amino acids and reagents are well known in 
the art and so are not discussed in detail herein (See, Atherton, et ah, 1989, Solid Phase 
Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, Peptide 
Chemistry, A Practical Textbook, 2nd Ed., Springer- Verlag). 

25 Purification of the resulting a2M receptor, HSP, a2M, or other oc2MR ligand 

peptides is accomplished using conventional procedures, such as preparative HPLC using gel 
permeation, partition and/or ion exchange chromatography. The choice of appropriate 
matrices and buffers are well known in the art and so are not described in detail herein. 

In addition, analogs and derivatives of <x2M receptor, HSP, oc2M, or other a2MR 

30 ligand protein can be chemically synthesized. Furthermore, if desired, nonclassical amino 
acids or chemical amino acid analogs can be introduced as a substitution or addition into the 
oc2M receptor, HSP, a2M, or other cc2MR ligand sequence. Non-classical amino acids 
include but are not limited to the D-isomers of the common amino acids, oc-amino isobutyric 
acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, 

35 Aib, 2-amino isobutyric acid, 3 -amino propionic acid, ornithine, norleucine, norvaline, 
hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 
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phenylglycine, cyclohexylalanine, |3-alanine, fluoro-amino acids, designer amino acids such 
as P-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. 

5 5. 1 .3 ANTIBODIES SPECIFIC FOR a2M RECEPTOR-HSP COMPLEXES 

Described herein are methods for the production of antibodies capable of specifically 
recognizing cc2M receptor epitopes, HSP-a2M receptor complex epitopes or epitopes of 
conserved variants or peptide fragments of the receptor or receptor complexes. Such 
antibodies are useful for therapeutic and diagnostic methods of the invention. 

1 0 Such antibodies may include, but are not limited to, polyclonal antibodies, 

monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab') 2 fragments, fragments produced by a Fab expression library, anti- 
idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. Such 
antibodies may be used, for example, in the detection of an a2M receptor or HSP-a2M 

1 5 receptor complex in an biological sample. Such antibodies may also be utilized in 

conjunction with, for example, compound screening schemes, as described below, in Section 
5.2, for the evaluation of the effect of test compounds on the interaction between HSPs and 
the a2M receptor. 

Anti-a2M receptor complex antibodies may additionally be used as a method for the 
20 inhibition of abnormal receptor product activity. Thus, such antibodies may, be utilized as 
part of treatment methods for HSP-cc2M receptor related disorders, e.g., autoimmune 
disorders. 

For the production of antibodies against cc2M receptor or receptor complexes, various 
host animals may be immunized by injection with an oc2M receptor or HSP-oc2M receptor 
25 complex, or a portion thereof. An antigenic portion of a2M receptor or HSP-a2M receptor 
complex can be readily predicted by algorithms known in the art. 

Host animals may include, but are not limited to rabbits, mice, and rats, to name but a 
few. Various adjuvants may be used to increase the immunological response, depending on 
the host species, including but not limited to Freund's (complete and incomplete), mineral 
30 gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and 
potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and 
Corynebacterium parvutn. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived 
35 from the sera of animals immunized with an antigen, such as an a2M receptor or HSP-oc2M 
receptor complex, or an antigenic functional derivative thereof. For the production of 



-23 - 



NY2- 1119469.2 



polyclonal antibodies, host animals such as those described above, may be immunized by 
injection with a2M receptor or HSP-oc2M receptor complex, or portion thereof, 
supplemented with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a 

5 particular antigen, may be obtained by any technique that provides for the production of 
antibody molecules by continuous cell lines in culture. These include, but are not limited to, 
the hybridoma technique of Kohler and Milstein, (1975, Nature 256, 495-497; and U.S. 
Patent No. 4,376,1 10), the human B-cell hybridoma technique (Kosbor et al, 1983, 
Immunology Today 4: 72; Cole et al, 1983, Proc. Natl. Acad. Sci. USA 80, 2026-2030), and 

j0 the EBV-hybridoma technique (Cole et al, 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin 
class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing 
the mAb of this invention may be cultivated in vitro or in vivo. Production of high titers of 
mAbs in vivo makes this the presently preferred method of production. 

15 In addition, techniques developed for the production of "chimeric antibodies" 

(Morrison, etal, 1984, Proc. Natl. Acad. Sci., 81: 6851-6855; Neuberger, et al, 1984, 
Nature 312: 604-608; Takeda, et al, 1985, Nature, 314: 452-454) by splicing the genes from 
a mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. A chimeric 

20 antibody is a molecule in which different portions are derived from different animal species, 
such as those having a variable region derived from a murine mAb and a human 
immunoglobulin constant region (see, e.g., Cabilly et al, U.S. Patent No. 4,816,567; and 
Boss et al, U.S. Patent No. 4,816397, which are incorporated herein by reference in their 
entirety). 

25 In an additional embodiment of the invention, monoclonal antibodies can be 

produced in germ- free animals (see PCT International Publication No. WO 89/12690, 
published December 12, 1989). According to the invention, human antibodies may be used 
and can be obtained by using human hybridomas (Cote et al, 1983, Proc. Natl. Acad. Sci. 
U.S.A. 80:2026-2030) or by transforming human B cells with EBV virus in vitro (Cole et al, 

30 1985, in Monoclonal Antibodies and Cancer Therapy , Alan R. Liss, pp. 77-96). Techniques 
developed for the production of "chimeric antibodies" (Morrison et al, 1984, Proc. Natl. 
Acad. Sci. U.S.A. 81:6851-6855; Neuberger et al, 1984, Nature 312:604-608; Takeda et al, 
1985, Nature 314:452-454) by splicing the genes from a mouse antibody molecule specific 
for an a2M receptor-HSP complex together with genes from a human antibody molecule of 

35 appropriate biological activity can also be used; such antibodies are within the scope of this 
invention. 
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Humanized antibodies are also provided (see U.S. Patent No. 5,225,539 by Winter). 
An immunoglobuin light or heavy chain variable region consists of a "framework" region 
interrupted by three hypervariable regions, referred to as complementarity determining 
regions (CDRs). The extent of the framework region and CDRs have been precisely defined 

5 (see, "Sequences of Proteins of Immunological Interest", Kabat, E. et al, U.S. Department of 
Health and Human Services (1983). Briefly, humanized antibodies are antibody molecules 
from non-human species having one or more CDRs from the non-human species and a 
framework region from a human immunoglobulin molecule. Such CDRS -grafted antibodies 
have been successfully constructed against various antigens, for example, antibodies against 

10 IL-2 receptor as described in Queen et al, 1989, Proc. Natl. Acad. Sci. USA 86:10029; 
antibodies against the cell surface receptor CAMPATH as described in Riechmann et al. , 
1988, Nature 332:323; antibodies against hepatitis B in Co et al, 1991, Proc. Natl. Acad. Sci. 
USA 88:2869; as well as against viral antigens of the respiratory syncytial virus in Tempest 
et al, 1991, Bio-Technology 9:267. Humanized antibodies are most preferred for 

1 5 therapeutic use in humans. 

Alternatively, techniques described for the production of single chain antibodies 
(U.S. Patent 4,946,778; Bird, 1988, Science 242: 423-426; Huston et al, 1988, Proc. Natl. 
Acad. Sci. USA 85: 5879-5883; and Ward et al, 1989, Nature 334: 544-546) can be adapted 
to produce single chain antibodies against a2M receptor or HSP-a2M receptor complexes, or 

20 portions thereof. Single chain antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 

Antibody fragments that recognize specific epitopes may be generated by known 
techniques. For example, such fragments include but are not limited to: the F(ab') 2 
fragments, which can be produced by pepsin digestion of the antibody molecule and the Fab 

25 fragments, which can be generated by reducing the disulfide bridges of the F(ab') 2 fragments. 
Alternatively, Fab expression libraries may be constructed (Huse et al, 1989, Science, 246: 
1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
desired specificity. 

Antibodies to the cc2M receptor can, in turn, be utilized to generate anti-idiotype 
30 antibodies that "mimic" the a2M receptor, using techniques well known to those skilled in 
the art (see, e.g., Greenspan & Bona, 1993, FASEB J 7(5):437-444; and Nissinoff, 1991, J. 
Immunol. 147(8):2429-2438). For example antibodies which bind to the ct2M receptor ECD 
and competitively inhibit the binding of HSPs to the a2M receptor can be used to generate 
anti-idiotypes that "mimic" the ECD and, therefore, bind and neutralize HSPs. Such 
35 neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in therapeutic 
regimens to neutralize the native ligand and treat HSP-cc2M receptor-related disorders, such 
as immunological disorders, proliferative disorders, and infectious diseases. 
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Alternatively, antibodies to the cc2M receptor that can act as agonists of the cc2M 
receptor activity can be generated. Such antibodies will bind to the cc2M receptor and 
activate the signal transducing activity of the receptor. In addition, antibodies that act as 
antagonist of the a2M receptor activity, i.e. inhibit the activation of the cc2M receptor would 
5 be particularly useful for treating autoimmune disorders, proliferative disorders, such as 
cancer, and infectious diseases. Methods for assaying for such agonists and antagonists are 
described in detail in Section 5.2, below. 



5.2 ASSAYS FOR THE IDENTIFICATION OF COMPOUNDS THAT INTERACT 
1 0 WITH THE a2M RECEPTOR 

The present invention is based on the discovery that the oc2M receptor recognizes 
HSP-antigenic peptide complexes and transports them within the cell for the purpose of 
presenting such antigenic molecules to cells of the immune system and eliciting an immune 
response. Thus, methods for identifying compounds that interact with the receptor, or . 
enhance or block the function of the receptor, are included in the invention. The present 
invention provides in vitro and in vivo assay systems, described in the subsections below, 
which can be used to identify compounds or compositions that interact with the a2M 
receptor, or modulate the activity of the a2M receptor and its interaction with HSPs or HSP- 
peptide complexes. 

The invention provides screening methodologies useful in the identification of small 
molecules, proteins and other compounds which interact with the cc2M receptor, or modulate 
the interaction of HSPs with the a2M receptor. Such compounds may bind the a2M 
receptor genes or gene products with differing affinities, and may serve as regulators of 
receptor activity in vivo with useful therapeutic applications in modulating the immune 
response. For example, certain compounds that inhibit receptor function may be used in 
patients to downregulate destructive immune responses which are caused by cellular release 
of HSPs. 

Methods to screen potential agents for their ability to interact with the ct2M receptor, 
or modulate a2M receptor expression and activity can be designed based on the inventor's 
discovery of the receptor and its role in HSP or HSP-peptide complex binding and 
recognition. a2M receptor protein, nucleic acids, and derivatives can be used in screening 
assays to detect molecules that specifically bind to HSP proteins, derivatives, or nucleic 
acids, and thus have potential use as agonists or antagonists of the a2M receptor, to modulate 
the immune response. In a preferred embodiment, such assays are performed to screen for 
molecules with potential utility as anti-autoimmune disease, anti-cancer and anti-infective 
drugs (such as anti-viral drugs and antibiotic drugs), or lead compounds for drug 
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development. For example, recombinant cells expressing a2M receptor nucleic acids can be 
used to recombinantly produce a2M receptor in these assays, to screen for molecules that 
interfere with the binding of HSPs to the a2M receptor. Similar methods can be used to 
screen for molecules that bind to the oc2M receptor derivatives or nucleic acids. Methods that 

5 can be used to carry out the foregoing are commonly known in the art. 

Compounds capable of specifically binding the a2M receptor can be useful for 
immunotherapy. In one embodiment, an assay is disclosed for identifying compounds that 
specifically bind the a2M receptor comprising: (a) contacting an cc2M receptor with one or 
more test compounds under conditions conducive to binding; and (b) identifying one or more 

IQ test compounds which specifically bind to the a2M receptor, such that a compound capable 
of specifically binding the <x2M receptor is identified as a compound useful for 
immunotherapy. 

Another method encompassed by the invention for identifying a compound useful for 
immunotherapy involves identifying a compound which modulates the binding of an cc2M 

2 5 receptor ligand to the a2M receptor. The term "a2M receptor ligand" as used herein, refers 
to an molecule capable of binding to the a2M receptor. Such a2M receptor ligands include, 
but are not limited to, oc2M and a2M complexes, heat shock proteins and heat shock protein 
complexes, lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), 
urokinase-type plasminogen activator (uPA), and exotoxins. Such ligands are typically 

20 endocytosed by cell upon binding to the cc2M receptor. The method comprises the steps of: 
(a) contacting an a2M receptor with an a2M receptor ligand, or fragment, or analog, 
derivative or mimetic thereof, in the presence of one or more test compound; and (b) 
measuring the amount of a2M receptor ligand, or fragment, analog, derivative or mimetic 
thereof, bound to the <x2M receptor, such that if the amount of bound cc2M receptor ligand 

25 measured in (b) differs from the amount of bound a2M receptor measured in the absence of 
the test compound, then a compound useful for immunotherapy that modulates the binding of 
an a2M receptor ligand to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound useful for 
immunotherapy which modulates the interaction between the a2M receptor and an <x2M 

30 receptor ligand is provided by the invention. This method comprises the steps of: (a) 

contacting an cc2M receptor with one or more test compounds; and (b) measuring the level of 
cc2M receptor activity or expression, such that if the level of activity or expression measured 
in (b) differs from the level of a2M receptor activity in the absence of one or more test 
compounds, then a compound that modulates the interaction between the cc2M receptor and 

35 an a2M receptor ligand is identified. 

In another embodiment, an assay for identifying a compound that modulates an HSP- 
a2M receptor-mediated process is disclosed. This assay comprises: (a) contacting a test 
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compound with an HSP and an a2M receptor; and (b) measuring the level of a2M receptor 
activity or expression, such that if the level of activity or expression measured in (b) differs 
from the level of a2M receptor activity in the absence of the test compound, then a 
compound that modulates an HSP-a2M receptor-mediated process is identified. In another 
5 embodiment, in which the compound identified is an antagonist which interferes with the 
interaction of the HSP with the a2M receptor, the method further comprises the step of 
determining whether the level interferes with the interaction of the HSP and the oc2M 
receptor. 

In another embodiment, a cell-based method for identifying a compound that 

10 modulates an HSP-cc2M receptor-mediated process is described. This method comprises the 
following steps: (a) contacting a test compound with a heat shock protein and an oc2M 
receptor-expressing cell; and (b) measuring the level of a2M receptor activity or expression 
in the cell, such that if the level of activity or expression measured in (b) differs from the 
level of a2M receptor activity in the absence of the test compound, then a compound that 

15 modulates an HSP-a2M receptor-mediated process is identified. 

In another embodiment, a receptor-ligand binding assay for identifying a compound 
that interacts with a2MR, or modulates the binding of an HSP to a2MR. One such method 
comprises: (a) contacting an HSP with an a2M receptor, or fragment, or analog, derivative 
or mimetic thereof, in the presence of a test compound; and (b) measuring the amount of heat 

20 shock protein bound to the a2M receptor, or fragment, analog, derivative or mimetic thereof, 
such that if the amount of bound heat shock protein measured in (b) differs from the amount 
of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound that modulates antigen 

25 presentation by a2MR-expressing cells is provided by the invention. In one embodiment, 
such a method comprises: (a) adding one or more test compounds to a mixture of a2MR- 
expressing cells and a complex comprising an oc2MR ligand and an antigenic molecule, 
under conditions conducive to a2MR-mediated endocytosis; (2) measuring the level of 
stimulation of antigen-specific cytotoxic T cells by the a2MR- expressing cells, such that if 

30 the level measured in (b) differs from the level of said stimulation in the absence of the one 
or more test compounds, then a compound that modulates antigen presentation by a2MR- 
expressing cells is identified. In another embodiment, a test compound is added to a mixture 
of a2MR-expressing cells and a complex consisting essentially of an HSP noncovalently 
associated with an antigenic molecule, under conditions conducive to <x2MR-mediated 

35 endocytosis; and the level of stimulation of antigen-specific cytotoxic T cells by the a2MR- 
expressing cells is measured, such that if the level measured differs from the level of said 
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stimulation in the absence of the test compound, then a compound that modulates HSP- 
mediated antigen presentation by a2MR-expressing cells is identified. 

The assays of the present invention may be first optimized on a small scale (i.e., in 
test tubes), and then scaled up for high-throughput assays. In various embodiments, the in 
5 vitro screening assays of the present invention may be performed using purified components 
or cell lysates. In other embodiments, the screening assays may be carried out in intact cells 
in culture and in animal models. In accordance with the present invention, test compounds 
which are shown to modulate the activity of the a2M receptor as described herein in vitro, 
will further be assayed in vivo, including cultured cells and animal models to determine if the 
test compound has the similar effects in vivo and to determine the effects of the test 
compound on antigen presentation, cytokine release, intracellular Ca^ release, T-cell 
cytotoxicity, tumor progression, the accumulation or degradation of positive and negative 
regulators, cellular proliferation, etc. 

! 5 5.2. 1 a2M RECEPTOR- LIGAND BINDING ASSAYS 

The screening assays, described herein, can be used to identify compounds and 
compositions, including peptides and organic, non-protein molecules that interact with the 
a2M receptor, or that modulate the interaction between HSPs and the oc2M receptor. 
Recombinant, synthetic, and otherwise exogenous compounds may have binding capacity 

20 and, therefore, may be candidates for pharmaceutical agents. Alternatively, the proteins and 
compounds include endogenous cellular components which interact with the identified genes 
and proteins in vivo. Such endogenous components may provide new targets for 
pharmaceutical and therapeutic interventions. 

Thus, in a preferred embodiment, both naturally occurring and/or synthetic 

25 compounds (e.g. , libraries of small molecules or peptides), may be screened for interacting 
with a2M receptor and/or modulating a2M receptor activity. In another series of 
embodiments, cell lysates or tissue homogenates may be screened for proteins or other 
compounds which bind to one of the normal or mutant oc2M receptor genes and a2M 
receptor polypeptides. 

30 The screening assays described herein may be used to identify small molecules, 

peptides or proteins, or derivatives, analogs and fragments thereof, that interact with and/or 
modulate the interaction of HSPs with the a2M receptor. Such compounds may be used as 
agonists or antagonists of the uptake of a2M receptor ligands, such as HSPs and HSP 
complexes, by the cell surface receptor. For example, compounds that modulate the a2M 

35 receptor-ligand interaction include, but are not limited to, compounds that bind to the a2M 
receptor, thereby either inhibiting (antagonists) or enhancing (agonists) the binding of 
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ligands, such as HSPs and HSP complexes, to the receptor, as well as compounds that bind to 
the ligand, such as for example, HSPs, thereby preventing or enhancing binding of ligand to 
the receptor. Compounds that affect a2M receptor gene activity (by affecting a2M receptor 
gene expression, including molecules, e.g., proteins or small organic molecules, that affect 

5 transcription or interfere with splicing events so that expression of the full length or truncated 
forms of a2M receptor can be modulated) can also be identified in the screens of the 
invention. Further, it should be noted that the assays described can also identify compounds 
that modulate a2M receptor ligand, for example HSP, uptake by cc2M receptor (e.g., 
compounds which affect downstream signaling in the a2M receptor signal transduction 

2Q pathway). The identification and use of such compounds which affect signaling events 
downstream of the <x2M receptor and thus modulate effects of the receptor on the immune 
response are within the scope of the invention. 

Compounds that affect the cc2M receptor gene activity (by affecting the ce2M receptor 
gene expression, including molecules, e.g., proteins or small organic molecules, that affect 

25 transcription or interfere with splicing events so that expression of the full length or the 

truncated form of the a2M receptor can be modulated) can also be identified in the screens of 
the invention. However, it should be noted that the assays described can also identify 
compounds that modulate the a2M receptor signal transduction (e.g., compounds which 
affect downstream signaling events, such as inhibitors or enhancers of endocytic activity 

20 which is activated by ligand binding to the cc2M receptor). The identification and use of such 
compounds which affect signaling events downstream of the cc2M receptor and thus 
modulate effects of the a2M receptor on the allergenic response are within the scope of the 
invention. 

The screening assays described herein are designed to detect compounds that 
25 modulate, i.e. interfere with or enhance, ligand-receptor interactions, including HSP-oc2M 
receptor interactions. As described in detail below, such assays are functional assays, such as 
binding assays, that can be adapted to a high-throughput screening methodologies. 

Binding assays can be used to identify compounds that modulate the interaction 
between ligands, for example, HSPs, and the a2M receptor. In one aspect of the invention 
30 the screens may be designed to identify compounds that disrupt the interaction between the 
a2M receptor and a ligand, such as, for example, HSPs or peptides derived from an HSP, 
a2M, or another a2M receptor ligand. Such compounds will be useful as lead compounds 
for antagonists of HSP-a2M receptor-related disorders and conditions, such as immune 
disorders, proliferative disorders, and infectious diseases. 
35 Binding assays may be performed either as direct binding assays or as competition 

binding assays. In a direct binding assay, a test compound is tested for binding either to the 
a2M receptor or to an a2M receptor ligand, such as an HSP. Then, in a second step, the test 
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compound is tested for its ability to modulate the ligand-a2M receptor interaction. 
Competition binding assays, on the other hand, assess the ability of a test compound to 
compete with a ligand, i.e. an HSP, for binding to the cc2M receptor. 

In a direct binding assay, either the ligand and/or the oc2M receptor is contacted with 

5 a test compound under conditions that allow binding of the test compound to the ligand or 
the receptor. The binding may take place in solution or on a solid surface. Preferably, the 
test compound is previously labeled for detection. Any detectable compound may be used 
for labeling, such as but not limited to, a luminescent, fluorescent, or radioactive isotope or 
group containing same, or a nonisotopic label, such as an enzyme or dye. After a period of 

lo incubation sufficient for binding to take place, the reaction is exposed to conditions and 
manipulations that remove excess or non-specifically bound test compound. Typically, it 
involves washing with an appropriate buffer. Finally, the presence of a ligand-test 
compound {e.g., HSP-test compound) or a the a2M receptor-test compound complex is 
detected. 

15 In a competition binding assay, test compounds are assayed for their ability to disrupt 

or enhance the binding of the ligand {e.g., HSP) to the a2M receptor. Labeled ligand (e.g., 
HSP) may be mixed with the a2M receptor or fragment or derivative thereof, and placed 
under conditions in which the interaction between them would normally occur, with and 
without the addition of the test compound. The amount of labeled ligand (e.g., HSP) that 

20 binds the a2M receptor may be compared to the amount bound in the presence or absence of 
test compound. 

In a preferred embodiment, to facilitate complex formation and detection, the binding 
assay is carried out with one or more components immoblilized on a solid surface. In 
various embodiments, the solid support could be, but is not restricted to, polycarbonate, 

25 polystyrene, polypropylene, polyethlene, glass, nitrocellulose, dextran, nylon, 

polyacrylamide and agarose. The support configuration can include beads, membranes, 
microparticles, the interior surface of a reaction vessel such as a microtiter plate, test tube or 
other reaction vessel. The immobilization of the a2M receptor, or other component, can be 
achieved through covalent or non-covalent attachments. In one embodiment, the attachment 

30 may be indirect, i.e. through an attached antibody. In another embodiment, the cc2M receptor 
and negative controls are tagged with an epitope, such as glutathione S-transferase (GST) so 
that the attachment to the solid surface can be mediated by a commercially available 
antibody such as anti-GST (Santa Cruz Biotechnology). 

For example, such an affinity binding assay may be performed using a the a2M 

35 receptor which is immobilized to a solid support. Typically, the non-mobilized component 
of the binding reaction, in this case either ligand (e.g., HSP) or the test compound, is labeled 
to enable detection. A variety of labeling methods are available and may be used, such as 
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luminescent, chromophore, fluorescent, or radioactive isotope or group containing same, and 
nonisotopic labels, such as enzymes or dyes. In a preferred embodiment, the test compound 
is labeled with a fluorophore such as fluorescein isothiocyanate (FITC, available from Sigma 
Chemicals, St. Louis). 

5 The labeled test compounds, or ligand (e.g., HSP) plus test compounds, are then 

allowed to contact with the solid support, under conditions that allow specific binding to 
occur. After the binding reaction has taken place, unbound and non-specifically bound test 
compounds are separated by means of washing the surface. Attachment of the binding 
partner to the solid phase can be accomplished in various ways known to those skilled in the 

jo art, including but not limited to chemical cross-linking, non-specific adhesion to a plastic 
surface, interaction with an antibody attached to the solid phase, interaction between a ligand 
attached to the binding partner (such as biotin) and a ligand-binding protein (such as avidin 
or streptavidin) attached to the solid phase, and so on. 

Finally, the label remaining on the solid surface may be detected by any detection 

1 5 method known in the art. For example, if the test compound is labeled with a fluorophore, a 
fluorimeter may be used to detect complexes. 

Preferably, the oc2M receptor is added to binding assays in the form of intact cells that 
express the a2M receptor, or isolated membranes containing the oc2M receptor. Thus, direct 
binding to the cc2M receptor or the ability of a test compound to modulate a ligand-a2M 

20 receptor complex (e.g., HSP- a2M receptor complex) may be assayed in intact cells in 
culture or in animal models in the presence and absence of the test compound. A labeled 
ligand (e.g., HSP) may be mixed with cells that express the a2M receptor, or to crude 
extracts obtained from such cells, and the test compound may be added. Isolated membranes 
may be used to identify compounds that interact with the cc2M receptor. For example, in a 

25 typical experiment using isolated membranes, cells may be genetically engineered to express 
the <x2M receptor. Membranes can be harvested by standard techniques and used in an in 
vitro binding assay. Labeled ligand (e.g., 125 I-labeled HSP) is bound to the membranes and 
assayed for specific activity; specific binding is determined by comparison with binding 
assays performed in the presence of excess unlabeled (cold) ligand. Alternatively, soluble 

30 oc2M receptor may be recombinantly expressed and utilized in non-cell based assays to 
identify compounds that bind to the a2M receptor. The recombinantly expressed a2M 
receptor polypeptides or fusion proteins containing the extracellular domain (ECD) of the 
oc2M receptor, or one or more subdomains thereof, can be used in the non-cell based 
screening assays. Alternatively, peptides corresponding to one or more of the CDs of the 

35 oc2M receptor, or fusion proteins containing one or more of the CDs of the a2M receptor can 
be used in non-cell based assay systems to identify compounds that bind to the cytoplasmic 
portion of the cc2M receptor; such compounds may be useful to modulate the signal 



-32- 



NY2- 1119469.2 



transduction pathway of the oc2M receptor. In non-cell based assays the recombinantly 
expressed the oc2M receptor is attached to a solid substrate such as a test tube, microtiter well 
or a column, by means well known to those in the art (see Ausubel et ah, supra). The test 
compounds are then assayed for their ability to bind to the a2M receptor. 

5 Alternatively, the binding reaction may be carried out in solution. In this assay, the 

labeled component is allowed to interact with its binding partner(s) in solution. If the size 
differences between the labeled component and its binding partner(s) permit such a 
separation, the separation can be achieved by passing the products of the binding reaction 
through an ultrafilter whose pores allow passage of unbound labeled component but not of its 

lO binding partner(s) or of labeled component bound to its partner(s). Separation can also be 
achieved using any reagent capable of capturing a binding partner of the labeled component 
from solution, such as an antibody against the binding partner, a ligand-binding protein 
which can interact with a ligand previously attached to the binding partner, and so on. 

In a one embodiment, for example, a phage library can be screened by passing phage 

1 5 from a continuous phage display library through a column containing purified a2M receptor, 
or derivative, analog, fragment, or domain, thereof, linked to a solid phase, such as plastic 
beads. By altering the stringency of the washing buffer, it is possible to enrich for phage that 
express peptides with high affinity for the a2M receptor. Phage isolated from the column 
can be cloned and the affinities of the short peptides can be measured directly. Sequences for 

20 more than one oligonucleotide can be combined to test for even higher affinity binding to the 
a2M receptor. Knowing which amino acid sequences confer the strongest binding to the 
a2M receptor, computer models can be used to identify the molecular contacts between the 
a2M receptor and the test compound. This will allow the design of non-protein compounds 
which mimic those contacts. Such a compound may have the same activity of the peptide 

25 and can be used therapeutically, having the advantage of being efficient and less costly to 
produce. 

In another specific embodiment of this aspect of the invention, the solid support is 
membranes containing the oc2M receptor attached to a microtiter dish. Test compounds, for 
example, cells that express library members are cultivated under conditions that allow 
30 expression of the library members in the microtiter dish. Library members that bind to the 
protein (or nucleic acid or derivative) are harvested. Such methods, are described by way of 
example in Parmley and Smith, 1988, Gene 73:305-318; Fowlkes et al, 1992, 
BioTechniques 13:422-427; PCT Publication No. WO 94/18318; and in references cited 
hereinabove. 
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In another embodiment of the present invention, interactions between the a2M 
receptor or ligand (e.g., HSP) and a test compound may be assayed in vitro. Known or 
unknown molecules are assayed for specific binding to the cc2M receptor nucleic acids, 
proteins, or derivatives under conditions conducive to binding, and then molecules that 

5 specifically bind to the oc2M receptor are identified. The two components can be measured 
in a variety of ways. One approach is to label one of the components with an easily 
detectable label, place it together with a test component(s) under conditions that allow 
binding to occur, perform a separation step which separates bound labeled component from 
unbound labeled component, and then measure the amount of bound component. In one 

IQ embodiment, the cc2M receptor can be labeled and added to a test agent, using conditions that 
allow binding to occur. Binding of the test agent can be determined using polyacrylamide 
gel analysis to compare complexes formed in the presence and absence of the test agent. 

In yet another embodiment, binding of ligand (e.g., HSP) to the a2M receptor may be 
assayed in intact cells in animal models. A labeled ligand (e.g., HSP) may be administered 

15 directly to an animal, with and without a test compound. Uptake of the ligand (e.g., HSP) 
may be measured in the presence and the absence of test compound. For these assays, host 
cells to which the test compound is added may be genetically engineered to express the a2M 
receptor and/or ligand (e.g., HSP), which may be transient, induced or constitutive, or stable. 
For the purposes of the screening methods of the present invention, a wide variety of host 

20 cells may be used including, but not limited to, tissue culture cells, mammalian cells, yeast 
cells, and bacteria. Mammalian cells such as macrophages or other cells that express the 
a2M receptor, i.e., cells of the monocytic lineage, liver parenchymal cells, fibroblasts, 
keratinocytes, neuronal cells, and placental syncytiotrophoblasts, may be a preferred cell type 
in which to carry out the assays of the present invention. Bacteria and yeast are relatively 

25 easy to cultivate but process proteins differently than mammalian cells. 

5.2.2 «2M RECEPTOR ACTIVITY ASSAYS 

After identification of a test compound that interacts with, or modulates the 
interaction of a ligand (e.g., HSP) with a2MR, the test compound can be further 

30 characterized to measure its effect on a2MR activity and the ligand-a2MR endocytic 
signaling pathway. For example, the test compound may be characterized by testing its 
effect on ligand (e.g., HSP) /a2MR cellular activity in vivo. Such assays include 
downstream signaling assays, antigen presentation assays, assays for antigen-specific 
activation of cytotoxic T cells, and the like. 

35 In various embodiments, a candidate compound identified in a primary assay may be 

tested for its effect on innate a2MR signaling activity. For example, downstream signaling 
effects of ct2M receptor activation which can be assayed include, but are not limited to: 
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enhanced locomotion and chemotaxis of macrophages (Forrester et al, 1983, Immunology 
50: 251-259), down regulation of proteinase synthesis, and elevation of intracellular calcium, 
inositol phosphates and cyclic AMP (Misra et al, 1993, Biochem. J., 290:885-891). Other 
innate immune responses that can be tested are release of cytokines {i.e., IL-12, IL1|3, 

5 GMCSF, and TNFcc). Thus, as secondary assays, any identified candidate compound can be 
tested for changes in such activities in the presence and absence. 

For example, in one embodiment, a chemotaxis assay can be used to further 
characterize a candidate identified by a primary screening assay. It is known that oc2M 
modified by protease interaction can induce directional migration of cells towards their 

10 ligand. A number of techniques can be used to test chemotactic migration in vitro (see, e.g., 
Leonard et al, 1995, "Measurement of a and B Chemokines", in Current Protocols in 
Immunology, 6.12.1-6.12.28, Ed. Coligan et al, John Wiley & Sons, Inc. 1995). For 
example, in one embodiment, a candidate compound can be tested for its ability to modulate 
the ability of cc2MR to induce migration of cells that express the receptor using a chemokine 

15 gradient in a multiwell Boy den chemotaxis chamber. In a specific example of this method, a 
serial dilution of a ligand (e.g., an HSP) / a2MR antagonist or agonist test compound 
identified in the primary screen is placed in the bottom wells of the Boyden chemotaxis 
chamber. A constant amount of ligand is also added to the dilution series. As a control, at 
least one aliquot contains only ligand (e.g., HSP). The contribution of the antagonist or 

20 agonist compound to the chemotactic activity of oc2MR is measured by comparing number of 
migrating cells on the lower surface of the membrane filter of the aliquots containing only 
ligand (e.g., HSP), with the number of cells in aliquots containing test compound and ligand 
(e.g., HSP). If addition of the test compound to the ligand (e.g., HSP) solution results in a 
decrease in the number of cells detected the membrane relative to the number of cells 

25 detected using a solution containing only ligand (e.g., HSP), then an antagonist of ligand 
(e.g., HSP) induction of chemotactic activity of a2MR-expressing cells is identified. 

Elevation in intracellular ionized calcium concentration ([Ca 2+ ] ; ) is also an indicator 
of a2MR activation (Misra et al, 1993, supra). Thus, in another embodiment, calcium flux 
assays can be used as secondary screens to further characterize modulators of ligand-a2MR 

30 interactions. Intracellular calcium ion concentration can be measured in cells that express the 
a2M receptor in the presence of the ligand, in the presence and the absence of a test 
compound. For example, calcium mobilization can be detected and measured by flow 
cytometry, by labeling with fluorescent dyes that are trapped intracellularly A fluorescent 
dye such as Indo-1 exhibits a change in emission spectrum upon binding calcium, the ratio of 

35 fluorescence produced by the calcium-bound dye to that produce by the unbound dye may be 
used to estimate the intracellular calcium concentration. In a specific embodiment, cells are 
incubated in a cuvette in media containing Indo-1 at 37°C and are excited, and fluorescence 
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is measured using a fluorimeter (Photon Technology Corporation, International). The ligand 
is added at a specific time point, in the presence and the absence of a test compound, EGTA 
is added to the cuvette to release and chelate total calcium, and the response is measured. 
Binding of ligand results in increased intracellular Ca 2+ concentration in cells that express 

5 ot2MR. An agonist results in a relative increased intracellular Ca 2+ concentration, whereas an 
antagonist results in a relative decreased intracellular Ca 2+ concentration 

In other embodiments, antigen-specific response assays may be used to detect the 
effect of a candidate compound on presentation of antigenic molecule by an a2MR ligand, 
for example an HSP or HSP complex. For example, an antigen presentation assay may be 

10 performed to determine the effect of a compound in vivo on the uptake of complexes capable 
of interacting with the cc2M receptor, e.g., HSP-antigenic molecule complexes, by cells 
expressing the oc2M receptor. Such re-presentation assays are known in the art, and have 
been described previously (Suto and Srivastava, 1995, Science 269:1585-1588). For 
example, in one embodiment, antigen presenting cells, such as a macrophage cell line {e.g., 

15 RAW264.7), are mixed with antigen-specific T cells in media, using approximately 10,000 
cells of each type at approximately a 1:1 ratio. Complexes of HSP (10 ug/ml) and a peptide 
antigen, as well as test compound, is added to the cells and the culture is incubated for 
approximately 20 hours. Stimulation of T cells may then be measured in the presence and 
absence of test compound. 

20 In another embodiment, antigen-specific T cell stimulation may be assayed. In one 

embodiment an IFN-y release assay may be used. After washing, cells are fixed, 
permeabilized, and reacted with dye-labeled antibodies reactive with human IFN-y (PE- anti- 
IFN-y). Samples are analyzed by flow cytometry using standard techniques. Alternatively, a 
filter immunoassay, ELISA (enzyme linked immunosorbent assay), or enzyme-linked 

25 immunospot assay (ELISPOT) assay, may be used to detect specific cytokines produced by 
an activated T cell. In one embodiment, for example, a nitrocellulose-backed microtiter plate 
is coated with a purified cytokine-specific primary antibody, i.e., anti-IFN-y, and the plate is 
blocked to avoid background due to nonspecific binding of other proteins. A sample of APC 
cells stimulated with antigen is diluted onto the wells of the microtiter plate. A labeled, e.g., 

30 biotin-labeled, secondary anti-cytokine antibody is added. The antibody cytokine complex 
can then be detected, i.e., by enzyme-conjugated streptavidin - cytokine-secreting cells will 
appear as "spots" by visual, microscopic, or electronic detection methods. In another 
embodiment, "tetramer staining" assay (Altaian et al, 1996, Science 274: 94-96) maybe 
used to identify antigen- specific T-cells. For example, an MHC molecule containing a 

35 specific peptide antigen, such as a tumor-specific antigen, is multimerized to make soluble 
peptide tetramers and labeled, for example, by complexing to streptavidin. The MHC- 
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peptide antigen complex is then mixed with a population of stimulated T cells. Biotin is then 
used to stain T cells which recognize and bind to the MHC-antigen complex. 



5.2.3 COMPOUNDS THAT CAN BE SCREENED IN ACCORDANCE WITH 
THE INVENTION 

The screening assays described herein may be used to identify small molecules, 
peptides or proteins, or derivatives, analogs and fragments thereof, that interact with, or 
modulate the interaction of a ligand (e.g., HSP) with the a2M receptor. The compounds 
which may be screened in accordance with the invention include, but are not limited to small 
molecules, peptides, antibodies and fragments thereof, and other organic compounds (e.g., 
peptidomimetics) that bind to the ECD of the a2M receptor and either inhibit the activity 
triggered by the natural ligand (i.e., antagonists) or mimic the activity triggered by the 
natural ligand (i.e., agonists), as well as small molecules, peptides, antibodies or fragments 
thereof, and other organic compounds. In one embodiment, such compounds include 
sequences of the a2M receptor, such as the ECD of the cc2M receptor (or a portion thereof), 
which can bind to and "neutralize" natural ligands, such as HSPs, a2M, LDL, etc. In 
another embodiment, such compounds include ligand sequences, such as HSP sequences 
and/or a2M sequences, which can bind to the active site of the cc2M receptor, and block its 
activity. 

Compounds that may be used for screening include, but are not limited to, peptides 
such as, for example, soluble peptides, including but not limited to members of random 
peptide libraries; (see, e.g., Lam et al, 1991, Nature 354:82-84; Houghten et al, 1991, 
Nature 354:84-86), and combinatorial chemistry-derived molecular library made of D- and/or 
L- configuration amino acids, phosphopeptides (including, but not limited to, members of 
random or partially degenerate, directed phosphopeptide libraries; see, e.g., Songyang et al, 
1993, Cell 72:767-778), antibodies (including, but not limited to, polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain antibodies, and FAb, F(ab') 2 and FAb 
expression library fragments, and epitope-binding fragments thereof), and small organic or 
inorganic molecules. 

In one embodiment of the present invention, peptide libraries may be used as a source 
of test compounds that can be used to screen for modulators of cc2MR interactions, such as 
HSP- a2M receptor. Diversity libraries, such as random or combinatorial peptide or 
nonpeptide libraries can be screened for molecules that specifically bind to the cc2M receptor. 
Many libraries are known in the art that can be used, e.g., chemically synthesized libraries, 
recombinant (e.g., phage display libraries), and in vitro translation-based libraries. 

Examples of chemically synthesized libraries are described in Fodor et al, 1991, 
Science 251:767-773; Houghten et al, 1991, Nature 354:84-86; Lam et al, 1991, 
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Nature 354:82-84; Medynski, 1994, Bio/Technology 12:709-710; Gallop et al, 1994, J. 
Medicinal Chemistry 37(9): 1233-1251; Ohlmeyer etal, 1993, Proc. Natl. Acad. Sci. USA 
90:10922-10926; Erb et al, 1994, Proc. Natl. Acad. Sci. USA 91:11422-11426; Houghten et 
al, 1992, Biotechniques 13:412; Jayawickreme et al, 1994, Proc. Natl. Acad. Sci. USA 
5 91:1614-1618; Salmon etal., 1993, Proc. Natl. Acad. Sci. USA 90:11708-11712; PCT 
Publication No. WO 93/20242; and Brenner and Lerner, 1992, Proc. Natl. Acad. Sci. USA 
89:5381-5383. 

Examples of phage display libraries are described in Scott & Smith, 1990, 
Science 249:386-390; Devlin et al, 1990, Science, 249:404-406; Christian et al, 1992, J. 

10 Mol. Biol. 227:711-718; Lenstra, 1992, J. Immunol. Meth. 152:149-157; Kay etal, 1993, 
Gene 128:59-65; and PCT Publication No. WO 94/18318 dated August 18, 1994. 

By way of examples of nonpeptide libraries, a benzodiazepine library {see e.g., Bunin 
et al, 1994, Proc. Natl. Acad. Sci. USA 91:4708-4712) can be adapted for use. Peptoid 
libraries (Simon et al, 1992, Proc. Natl. Acad. Sci. USA 89:9367-9371) can also be used. 

j 5 Another example of a library that can be used, in which the amide functionalities in peptides 
have been permethylated to generate a chemically transformed combinatorial library, is 
described by Ostresh et al (1994, Proc. Natl. Acad. Sci. USA 91 :1 1 138-1 1 142). 

Screening the libraries can be accomplished by any of a variety of commonly known 
methods. See, e.g., the following references, which disclose screening of peptide libraries: 

20 Parmley & Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott & Smith, 1990, 

Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 
1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al, 1994, Cell 76:933-945; Staudt et 
al, 1988, Science 241:577-580; Bock etal, 1992, Nature 355:564-566; Tuerk et al, 1992, 
Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 

25 Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, all to 
Ladner et al; Rebar & Pabo, 1993, Science 263:671-673; and PCT Publication No. 
WO 94/18318. 

In another embodiment of the present invention, the screening may be performed by 
adding the labeled ligand (e.g., HSP) to in vitro translation systems such as a rabbit 
30 reticulocyte lysate (RRL) system and then proceeding with in vitro priming reaction. In vitro 
translation-based libraries include but are not limited to those described in PCT Publication 
No. WO 91/05058 dated April 18, 1991; and Mattheakis et al, 1994, Proc. Natl. Acad. Sci. 
USA 91:9022-9026. 

Compounds that can be tested and identified methods described herein can include, 
35 but are not limited to, compounds obtained from any commercial source, including Aldrich 
(Milwaukee, WI 53233), Sigma Chemical (St. Louis, MO), Fluka Chemie AG (Buchs, 
Switzerland) Fluka Chemical Corp. (Ronkonkoma, NY;), Eastman Chemical Company, Fine 
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Chemicals (Kingsport, TN), Boehringer Mannheim GmbH (Mannheim, Germany), Takasago 
(Rockleigh, NJ), SST Corporation (Clifton, NJ), Ferro (Zachary, LA 70791), Riedel-deHaen 
Aktiengesellschaft (Seelze, Germany), PPG Industries Inc., Fine Chemicals (Pittsburgh, PA 
15272). Further any kind of natural products may be screened using the methods of the 

5 invention, including microbial, fungal, plant or animal extracts. 

Furthermore, diversity libraries of test compounds, including small molecule test 
compounds, may be utilized. For example, libraries may be commercially obtained from 
Specs and BioSpecs B.V. (Rijswijk, The Netherlands), Chembridge Corporation (San Diego, 
CA), Contract Service Company (Dolgoprudny, Moscow Region, Russia), Comgenex USA 

1 q Inc. (Princeton, NJ), Maybridge Chemicals Ltd. (Cornwall PL34 OFfW, United Kingdom), 
and Asinex (Moscow, Russia). 

Still further, combinatorial library methods known in the art, can be utilize, including, 
but not limited to: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring deconvolution; the "one-bead 

15 one-compound" library method; and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam,1997, Anticancer Drug Des. 12:145). Combinatorial libraries of test 
compounds, including small molecule test compounds, can be utilized, and may, for 

20 example, be generated as disclosed in Eichler & Houghten, 1995, Mol. Med. Today 1:174- 
180; Dolle, 1997, Mol. Divers. 2:223-236; and Lam, 1997, Anticancer Drug Des. 12:145- 
167. 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
for example in: DeWitt et al, 1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al, 1994, 
25 Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al, 1994, J. Med. Chem. 37:2678; Cho 
et al, 1993, Science 261:1303; Carrell et al, 1994, Angew. Chem. Int. Ed. Engl. 33:2059; 
Carell et al, 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al, 1994, J. Med. 
Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992, 
30 BioTechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
1993, Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent Nos. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al, 1992, Proc. Natl. Acad. Sci. 
USA 89:1865-1869) or phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, 
Science 249:404-406; Cwirla et al, 1990, Proc. Natl. Acad. Sci. USA 87:6378-6382; and 
35 Felici, 1991, J. Mol. Biol. 222:301-310). 

Screening the libraries can be accomplished by any of a variety of commonly known 
methods. See, e.g., the following references, which disclose screening of peptide libraries: 



-39- 



NY2- 1119469.2 



Parmley & Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott & Smith, 1990, 
Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 
1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al, 1994, Cell 76:933-945; Staudt et 
al, 1988, Science 241:577-580; Bock et al, 1992, Nature 355:564-566; Tuerk et al, 1992, 
5 Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 
Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, all to 
Ladner et al; Rebar & Pabo, 1993, Science 263:671-673; and PCT Publication No. 
WO 94/18318. 



5.3 IDENTIFICATION OF FRAGMENTS OF THE <x2M RECEPTOR AND/OR a2M 
RECEPTOR LIGANDS, SUCH AS HSPS, USEFUL FOR IMMUNOTHERAPY 

The invention also encompasses methods for identifying ligand-binding cc2MR 
fragments (such as "HSP-binding domains"), and analogs, muteins, or derivatives thereof, 
which are capable of binding to, and uptake of, a2MR ligand-antigenic peptide, such as 
HSP-antigenic peptide complexes. Such ligand-binding a2MR fragment, e.g., HSP-binding 
domains, can then be tested for activity in vivo and in vitro using the a2M receptor/ligand 
binding assays, described in Section 5.2.1, above. In one embodiment, such a method for 
identifying an a2MR fragment capable of binding a heat shock protein comprises the steps 
of: (a) contacting a heat shock protein with one or more cc2MR fragments; and (b) 
identifying an a2MR polypeptide fragment which specifically binds to the heat shock 
protein. 

Ligand-binding domains, e.g., HSP-binding domains, of the oc2MR capable of 
binding ligand-antigenic peptide complexes, such as HSP-antigenic peptide complexes, and 
can be further tested for activity using either in vivo binding assays, re-presentation assays, or 
CTL assays, such as those described in Section 5.2.2, above. For example, one such method 
for identifying an oc2MR fragment capable of inducing an HSP-a2M receptor-mediated 
process comprises the steps of: (a) contacting a heat shock protein with cell expressing 
a2MR fragment; and (b) measuring the level of oc2MR activity in the cell, such that if the 
level of the HSP-oc2M receptor-mediated process or activity measured in (b) is greater than 
the level of os2MR activity in the absence of the oc2MR fragment, then an a2MR fragment 
capable of inducing an HSP-a2M receptor-mediated process is identified. Depending on 
their behavior in such assays, such molecules can be used to either enhance or, alternatively, 
block the function of the receptor when administered or expressed in vivo. For example, 
these assays can be used to identify a2MR HSP-binding domains which can bind HSP- 
antigen complexes and negatively interfere with their uptake by antigen presenting cells. 
These antagonists could be used to downregulate immune responses which are caused by 
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cellular release of HSPs. Alternatively, certain a2MR HSP-binding domains may be used to 
enhance HSP-antigen complex uptake and signaling. Such agonists could be administered or 
expressed in subjects to elicit an immune response against an antigen of interest. 

In another embodiment, the invention encompasses methods for identifying ligand 

5 fragment, such as HSP fragments, which are capable of binding and being taken up by the 
a2M receptor ("a2M receptor-binding domains"), and analogs, muteins, or derivatives 
thereof. As described for assays for a2M receptor-related polypeptides described above, 
such cc2M receptor-binding domains can then be tested for activity in vivo and in vitro using 
the binding assays described in Section 5.2.1, above. For example, one such method for 

1 q identifying a heat shock protein fragment capable of binding an a2M receptor comprises: (a) 
contacting an a2M receptor with one or more heat shock protein fragments; and (b) 
identifying a heat shock protein fragment which specifically binds to the oc2M receptor. 

Ligand fragments, such as HSP fragments, of interest may be further tested in cells, 
using in vivo binding assays, re-presentation assays, or CTL assays, such as those described 

25 in Section 5.2.2, above. For example, in one embodiment, such a method for identifying a 
heat shock protein fragment capable of inducing an HSP-cc2M receptor-mediated process 
comprises: a) contacting an a2M receptor fragment with a cell expressing a heat shock 
protein; and b) measuring the level of a2MR activity in the cell, such that if the level of the 
HSP-a2M receptor-mediated process or activity measured in (b) is greater than the level of 

20 cx2MR activity in the absence of said heat shock protein fragment. Alternatively, a2M 

receptor-binding domains which decrease uptake of HSPs could be used to block HSP uptake 
by the a2M receptor. In one embodiment, such HSP fragments comprising oc2M receptor- 
binding domain sequences could be used to construct recombinant fusion proteins, comprised 
of a heat shock protein oc2M receptor-binding domain and an antigenic peptide sequence. 

25 Such recombinant fusion proteins may be used to elicit an immune response and to treat or 
prevent immune diseases and disorders (Suzue et al, 1997, Proc. Natl. Acad. Sci. U.S.A. 94 : 
13146-51). 

The oc2M receptor fragments, analogs, muteins, and derivatives and/or ligand (e.g., 
HSP) fragments, analogs, muteins, and derivatives of the invention may be produced by 
30 recombinant DNA techniques, synthetic methods, or by enzymatic or chemical cleavage of 
native a2M receptor and/or ligands (e.g., HSPs). 

Any eukaryotic cell may serve as the nucleic acid source for obtaining the coding 
region of an a2M receptor or ct2M receptor ligand (e.g., HSP) gene. Nucleic acid sequences 
encoding ligand, e.g., HSPs, and or the a2M receptor can be isolated from vertebrate, 
35 mammalian, as well as primate sources, including humans. Amino acid sequences and 
nucleotide sequences of naturally occurring ligands, e.g., HSPs, and cc2M receptor are 
generally available in sequence databases, such as Genbank. 
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The DNA may be obtained by standard procedures known in the art by DNA 
amplification or molecular cloning directly from a tissue, cell culture, or cloned DNA (e.g., a 
DNA "library"). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will contain only 

5 exon sequences. In a preferred embodiment, DNA can be amplified from genomic or cDNA 
by polymerase chain reaction (PCR) amplification using primers designed from the known 
sequence of an a2M receptor ligand, e.g., HSP, cc2M, or other a2MR ligand. The 
polymerase chain reaction (PCR) is commonly used for obtaining genes or gene fragments of 
interest. For example, a nucleotide sequence encoding a fragment of any desired length can 

jq be generated using PCR primers that flank the nucleotide sequence encoding the peptide- 
binding domain. Alternatively, an a2MR ligand, e.g., HSP, a2M, or other a2MR ligand 
receptor gene sequence can be cleaved at appropriate sites with restriction endonuclease(s) if 
such sites are available, releasing a fragment of DNA encoding the peptide-binding domain. 
If convenient restriction sites are not available, they may be created in the appropriate 

1 5 positions by site-directed mutagenesis and/or DNA amplification methods known in the art 
(see, for example, Shankarappa et al, 1992, PCR Method Appl. 1:277-278). The DNA 
fragment that encodes a fragment of the ligand (e.g., HSP) or oc2M receptor gene is then 
isolated, and ligated into an appropriate expression vector, care being taken to ensure that the 
proper translation reading frame is maintained. Alternatives to isolating the genomic DNA 

20 include, but are not limited to, chemically synthesizing the gene sequence itself from a 

known sequence or making cDNA to the mRNA which encodes the ligand (e.g., HSP) and/or 
a2M receptor. 

Any technique for mutagenesis known in the art can be used to modify individual 
nucleotides in a DNA sequence, for purpose of making amino acid substitution(s) in the 

25 expressed peptide sequence, or for creating/deleting restriction sites to facilitate further 
manipulations. Such techniques include but are not limited to, chemical mutagenesis, in 
vitro site-directed mutagenesis (Hutchinson, C, et al, 1978, J. Biol. Chem 253:6551), 
oligonucleotide-directed mutagenesis (Smith, 1985, Ann. Rev. Genet. 19:423-463; Hill et al, 
1987, Methods Enzymol. 155:558-568), PCR-based overlap extension (Ho et al, 1989, Gene 

30 77:51-59), PCR-based megaprimer mutagenesis (Sarkar et al, 1990, Biotechniques, 8:404- 
407), etc. Modifications can be confirmed by double stranded dideoxy DNA sequencing. 

An alternative to producing a2M receptor and/or ligand (e.g., HSP) fragments by 
recombinant techniques is peptide synthesis. For example, a peptide corresponding to a 
portion of an a2M receptor and/or ligand (e.g., HSP) comprising the substrate-binding 

35 domain, or which binds peptides in vitro, can be synthesized by use of a peptide synthesizer. 
Conventional peptide synthesis may be used or other synthetic protocols well known in the 
art. 



-42- 



NY2- 1119469.2 



In addition, analogs and derivatives of a2M receptor and/or ligand (e.g., HSP) can be 
chemically synthesized. Furthermore, if desired, nonclassical amino acids or chemical 
amino acid analogs can be introduced as a substitution or addition into the oc2M receptor 
and/or ligand (e.g., HSP) sequence. Non-classical amino acids include but are not limited to 

5 the D-isomers of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, 
Abu, 2-amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric 
acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 
citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, P- 
alanine, fluoro-amino acids, designer amino acids such as P-methyl amino acids, Ca-methyl 

! q amino acids, Na-methyl amino acids, and amino acid analogs in general. 

a2M receptor and/or ligand (e.g., HSP) peptides, or a mutant or derivative thereof, 
may be synthesized by solid-phase peptide synthesis using procedures similar to those 
described by Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-cc-protected 
amino acids having protected side chains are added stepwise to a growing polypeptide chain 

j 5 linked by its C-terminal and to an insoluble polymeric support i.e., polystyrene beads. The 
peptides are synthesized by linking an amino group of an N-a-deprotected amino acid to an 
a-carboxyl group of an N-cc-protected amino acid that has been activated by reacting it with a 
reagent such as dicyclohexylcarbodiimide. The attachment of a free amino group to the 
activated carboxyl leads to peptide bond formation. The most commonly used N-oc- 

20 protecting groups include Boc which is acid labile and Fmoc which is base labile. Details of 
appropriate chemistries, resins, protecting groups, protected amino acids and reagents are 
well known in the art and so are not discussed in detail herein (See, Atherton, et ah, 1989, 
Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, 
Peptide Chemistry, A Practical Textbook, 2nd Ed., Springer- Verlag). 

25 Purification of the resulting fragment is accomplished using conventional procedures, 

such as preparative HPLC using gel permeation, partition and/or ion exchange 
chromatography. The choice of appropriate matrices and buffers are well known in the art 
and so are not described in detail herein. 

In an alternative embodiment, fragments of an a2M receptor and/or ligand (e.g., 

30 HSP) may be obtained by chemical or enzymatic cleavage of native or recombinant a2M 
receptor and/or ligand (e.g., HSP) molecules. Specific chemical cleavage can be performed 
by cyanogen bromide, NaBH 4 , acetylation, formylation, oxidation, reduction, metabolic 
synthesis in the presence of tunicamycin, etc.. Endoproteases that cleave at specific sites 
can also be used. Such proteases are known in the art, including, but not limited to, trypsin, 

35 a-chymotrypsin, V8 protease, papain, and proteinase K (see Ausubel et ah, (eds.), in 
"Current Protocols in Molecular Biology", Greene Publishing Associates and Wiley 
Interscience, New York, 17.4.6-17.4.8). The a2M receptor and/or ligand (e.g., HSP) amino 
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acid sequence of interest can be examined for the recognition sites of these proteases. An 
enzyme is chosen which can release a pep tide-binding domain or peptide-binding fragment. 
The a2M receptor and/or ligand (e.g., HSP) molecule is then incubated with the protease, 
under conditions that allow digestion by the protease and release of the specifically 

5 designated peptide-binding fragments. Alternatively, such protease digestions can be carried 
out blindly, i.e., not knowing which digestion product will contain the peptide-binding 
domain, using specific or general specificity proteases, such as proteinase K or pronase. 

Once a fragment is prepared, the digestion products may be purified as described 
above, and subsequently tested for the ability to bind peptide or for immunogenicity. 

j 0 Methods for determining the immunogenicity of a2M receptor ligand (e.g. , HSP) complexes 
by cytotoxicity tests are described in Section 5.2.2. 

5.4 DRUG DESIGN 

Upon identification of a compound that interacts with a2MR, or modulates the 
15 interaction of an a2M receptor ligand, such as an HSP, with the a2M receptor, such a 

compound can be further investigated to test for an ability to alter the immune response. In 
particular, for example, the compounds identified via the present methods can be further 
tested in vivo in accepted animal models of HSP-a2MR-mediated processes and HSP- a2MR 
related disorders, such as, e.g., immune disorders, proliferative disorders, and infectious 
20 diseases. 

Computer modeling and searching technologies permit identification of compounds, 
or the improvement of already identified compounds, which can modulate the interaction of 
the a2M receptor with its ligand, e.g., an HSP. Having identified such a compound or 
composition, the active sites or regions are identified. Such active sites might typically be 

25 ligand binding sites. The active site can be identified using methods known in the art 
including, for example, from the amino acid sequences of peptides, from the nucleotide 
sequences of nucleic acids, or from study of complexes of the relevant compound or 
composition with its natural ligand. In the latter case, chemical or X-ray crystallographic 
methods can be used to find the active site by finding where on the factor the complexed 

30 ligand is found. 

Next, the three dimensional geometric structure of the active site is determined. This 
can be done by known methods, including X-ray crystallography, which can determine a 
complete molecular structure. On the other hand, solid or liquid phase NMR can be used to 
determine certain intra-molecular distances. Any other experimental method of structure 
35 determination can be used to obtain partial or complete geometric structures. The geometric 
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structures may be measured with a complexed ligand, natural or artificial, which may 
increase the accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate structure is determined, the methods of 
computer based numerical modeling can be used to complete the structure or improve its 

5 accuracy. Any recognized modeling method may be used, including parameterized models 
specific to particular biopolymers such as proteins or nucleic acids, molecular dynamics 
models based on computing molecular motions, statistical mechanics models based on 
thermal ensembles, or combined models. For most types of models, standard molecular 
force fields, representing the forces between constituent atoms and groups, are necessary, and 

1Q can be selected from force fields known in physical chemistry. The incomplete or less 
accurate experimental structures can serve as constraints on the complete and more accurate 
structures computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 
modeling, or by a combination, candidate modulating compounds can be identified by 

j 5 searching databases containing compounds along with information on their molecular 

structure. Such a search seeks compounds having structures that match the determined active 
site structure and that interact with the groups defining the active site. Such a search can be 
manual, but is preferably computer assisted. These compounds found from this search are 
potential the cc2M receptor-modulating compounds. 

20 Alternatively, these methods can be used to identify improved modulating 

compounds from an already known modulating compound or ligand. The composition of the 
known compound can be modified and the structural effects of modification can be 
determined using the experimental and computer modeling methods described above applied 
to the new composition. The altered structure is then compared to the active site structure of 

25 the compound to determine if an improved fit or interaction results. In this manner 
systematic variations in composition, such as by varying side groups, can be quickly 
evaluated to obtain modified modulating compounds or ligands of improved specificity or 
activity. 

Further experimental and computer modeling methods useful to identify modulating 
30 compounds based upon identification of the active sites of either the ct2M receptor or the 
HSP, and other a2M receptor ligands and their analogs, will be apparent to those of skill in 
the art. 

Examples of molecular modeling systems are the CHARMm and QUANTA 
programs (Polygen Corporation, Waltham, MA). CHARMm performs the energy 
35 minimization and molecular dynamics functions. QUANTA performs the construction, 
graphic modelling and analysis of molecular structure. QUANTA allows interactive 
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construction, modification, visualization, and analysis of the behavior of molecules with each 
other. 

A number of articles review computer modeling of drugs interactive with specific 
proteins, such as Rotivinen et al.) 1988, Acta Pharmaceutical Fennica 97:159-166); Ripka 

5 (1988 New Scientist 54-57); McKinaly and Rossmann (1989, Annu. Rev. Pharmacol. 
Toxiciol. 29:111-122); Perry andDavies, OSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 Alan R. Liss, Inc. 1989; Lewis and Dean (1989, 
Proc. R. Soc. Lond. 236:125-140 and 141-162); and, with respect to a model receptor for 
nucleic acid components, Askew et al. (1989, J. Am. Chem. Soc. 1 1 1 :1082-1090). Other 

iq computer programs that screen and graphically depict chemicals are available from 

companies such as BioDesign, Inc. (Pasadena, CA.), Allelix, Inc. (Mississauga, Ontario, 
Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these are primarily designed 
for application to drugs specific to particular proteins, they can be adapted to design of drugs 
specific to regions of DNA or RNA, once that region is identified. 

15 

5.5 DIAGNOSTIC USES 

The cc2M receptor is a cell surface protein present on many tissues and cell types 
(Herz et al, 1988, EMBO J. 7:41 19-27; Moestrup et al, 1992, Cell Tissue Res. 269: 375- 
82), that appears to be involved in the specific uptake and re-presentation of a2M receptor 

20 ligands, such as HSPs and HSP- peptide complexes. The cc2M receptor was initially 
identified as a heat shock protein receptor due to its interaction with gp96, which is 
exclusively intracellular and is released as a result of necrotic but not apoptotic cell death. 
Thus, gp96 uptake by the cc2M receptor may act as a sensor of necrotic cell death. As such, 
a2M receptor-ligand complexes may be used to detect and diagnose proliferative disorders, 

25 such as cancer, autoimmune disorders and infectious disease. Therefore, oc2M receptor 
proteins, analogues, derivatives, and subsequences thereof, a2M receptor nucleic acids (and 
sequences complementary thereto), and anti-a2M receptor antibodies, have uses in detecting 
and diagnosing such disorders. 

The <x2M receptor and a2M receptor nucleic acids can be used in assays to detect, 

30 prognose, or diagnose immune system disorders that may result in tumorigenesis, 
carcinomas, adenomas etc, and viral disease. 

The molecules of the present invention can be used in assays, such as immunoassays, 
to detect, prognose, diagnose, or monitor various conditions, diseases, and disorders affecting 
a2M receptor expression, or monitor the treatment thereof. In particular, such an 

35 immunoassay is carried out by a method comprising contacting a sample derived from a 
patient with an HSP-a2M receptor specific antibody under conditions such that 
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immunospecific binding can occur, and detecting or measuring the amount of any 
immunospecific binding by the antibody. In a specific aspect, such binding of antibody, in 
tissue sections, can be used to detect aberrant a2M receptor localization or aberrant (e.g. , low 
or absent) levels of cc2M receptor. In a specific embodiment, antibody to the oc2M receptor 

5 can be used to assay a patient tissue or serum sample for the presence of the a2M receptor 
where an aberrant level of a2M receptor is an indication of a diseased condition. By 
"aberrant levels," is meant increased or decreased levels relative to that present, or a standard 
level representing that present, in an analogous sample from a portion of the body or from a 
subject not having the disorder. 

10 The immunoassays which can be used include but are not limited to competitive and 

non-competitive assay systems using techniques such as western blots, immunohisto- 
chemistry radioimmunoassays, ELISA, "sandwich" immunoassays, immunoprecipitation 
assays, precipitin reactions, gel diffusion precipitin reactions, immunodiffusion assays, 
agglutination assays, complement-fixation assays, immunoradiometric assays, fluorescent 

Y 5 immunoassays, protein A immunoassays, to name but a few. 

cc2M receptor genes and related nucleic acid sequences and subsequences, including 
complementary sequences, can also be used in hybridization assays. oc2M receptor nucleic 
acid sequences, or subsequences thereof, comprising about at least 8 nucleotides, can be used 
as hybridization probes. Hybridization assays can be used to detect, prognose, diagnose, or 

20 monitor conditions, disorders, or disease states associated with aberrant changes in a2M 
receptor expression and/or activity as described supra. In particular, such a hybridization 
assay is carried out by a method comprising contacting a sample containing nucleic acid with 
a nucleic acid probe capable of hybridizing to a2M receptor DNA or RNA, under conditions 
such that hybridization can occur, and detecting or measuring any resulting hybridization. 

25 In specific embodiments, diseases and disorders involving decreased immune 

responsiveness during an infection or malignant disorder can be diagnosed, or their suspected 
presence can be screened for, or a predisposition to develop such disorders can be detected, 
by detecting decreased levels of cc2M receptor protein, oc2M receptor RNA, or the a2M 
receptor functional activity (e.g., binding to HSP, antibody-binding activity etc.), or by 

30 detecting mutations in oc2M receptor RNA, DNA or oc2M receptor protein (e.g., 

translocations in the a2M receptor nucleic acids, truncations in the cc2M receptor gene or 
protein, changes in nucleotide or amino acid sequence relative to wild-type <x2M receptor) 
that cause decreased expression or activity of a2M receptor. Such diseases and disorders 
include but are not limited to those described in Sections 5.7, 5.8, and 5.9. By way of 

35 example, levels of the a2M receptor protein can be detected by immunoassay, levels of a2M 
receptor RNA can be detected by hybridization assays (e.g., Northern blots, in situ- 
hybridization), a2M receptor activity can be assayed by measuring binding activities in vivo 
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or in vitro. Translocations, deletions, and point mutations in a2M receptor nucleic acids can 
be detected by Southern blotting, FISH, RFLP analysis, SSCP, PCR using primers, 
preferably primers that generate a fragment spanning at least most of the cc2M receptor gene, 
sequencing of a2M receptor genomic DNA or cDNA obtained from the patient, etc. 

5 In a preferred embodiment, levels of a2M receptor mRNA or protein in a patient 

sample are detected or measured relative to the levels present in an analogous sample from a 
subject not having the malignancy or hyperproliferative disorder. Decreased levels indicate 
that the subject may develop, or have a predisposition to developing, viral infection, 
malignancy, or hyperproliferative disorder. 

10 In another specific embodiment, diseases and disorders involving a deficient immune 

responsiveness resulting in cell proliferation or in which cell proliferation is desirable for 
treatment, are diagnosed, or their suspected presence can be screened for, or a predisposition 
to develop such disorders can be detected, by detecting increased levels of the cc2M receptor 
protein, a2M receptor RNA, or the a2M receptor functional activity (e.g., HSP binding or 

1 5 a2M receptor antibody, etc.), or by detecting mutations in a2M receptor RNA, DNA or 
protein (e.g., translocations in a2M receptor nucleic acids, truncations in the gene or protein, 
changes in nucleotide or amino acid sequence relative to wild-type a2M receptor) that cause 
increased expression or activity of the a2M receptor. Such diseases and disorders include, 
but are not limited to, those described in Sections 5.7, 5.8, and 5.9. By way of example, 

20 levels of the oc2M receptor protein, levels of cc2M receptor RNA, a2M receptor binding 

activity, and the presence of translocations or point mutations can be determined as described 
above. 

In a specific embodiment, levels of a2M receptor mRNA or protein in a patient 
sample are detected or measured, relative to the levels present in an analogous sample from a 

25 subject not having the disorder, in which increased levels indicate that the subject has, or has 
a predisposition to, an autoimmune disorder. 

Kits for diagnostic use are also provided, that comprise in one or more containers an 
anti-a2M receptor antibody, and, optionally, a labeled binding partner to the antibody. 
Alternatively, the anti-a2M receptor antibody can be labeled (with a detectable marker, e.g., 

30 a chemiluminescent, enzymatic, fluorescent, or radioactive moiety). A kit is also provided 
that comprises in one or more containers a nucleic acid probe capable of hybridizing to a2M 
receptor RNA. In a specific embodiment, a kit can comprise in one or more containers a pair 
of primers (e.g., each in the size range of 6-30 nucleotides) that are capable of priming 
amplification [e.g., by polymerase chain reaction (see e.g., Innis et al, 1990, PCR Protocols, 

35 Academic Press, Inc., San Diego, CA), ligase chain reaction (see EP 320,308) use of QP 
replicase, cyclic probe reaction, or other methods known in the art] under appropriate 
reaction conditions of at least a portion of an a2M receptor nucleic acid. A kit can optionally 
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further comprise in a container a predetermined amount of a purified oc2M receptor protein or 
nucleic acid, e.g., for use as a standard or control. 

5.6 THERAPEUTIC USES 

5 The invention further encompasses methods for modulating the immune response. 

The cc2M receptor recognizes and transports antigenic peptide complexes (e.g., HSP- 
antigenic peptide complexes) for the purpose of presenting such antigenic molecules to cells 
of the immune system and eliciting an immune response. Thus, the compositions and 
methods of the invention may be used for therapeutic treatment of HSP-a2M receptor-related 

10 disorders and conditions, such as autoimmune diseases, cancer and infectious diseases. In 
particular, as described in detail hereinbelow, recombinant cells comprising ct2M receptor 
complexes, such as HSP-antigenic peptide complexes, antibodies and other compounds that 
interact with the a2M receptor, or modulate the interaction between the a2M receptor and its 
ligands, e.g., HSP, as well as other compounds that modulate HSP-a2M receptor-mediated 

15 processes may be used to elicit, or block, an immune response to treat such HSP- a2M 
receptor-related disorders and conditions. 

5.6. 1 THERAPEUTIC USE OF IDENTIFIED AGONISTS AND ANTAGONISTS 
Compounds, such as those identified by screening methods provided herein, that 

20 

interact with the oc2M receptor (herein "cc2MR"), or modulate the interaction between the 
oc2M receptor and its ligand, e.g., HSP, can be useful as therapeutics. Such compounds, 
include, but are not limited to, agonists, antagonists, such as antibodies, antisense RNAs and 
ribozymes Compounds which interfere with ligand (e.g., HSP) -a2M receptor interaction 
can be used to block an immune response, and can be used to treat autoimmune responses 
and conditions. Other antibodies, agonists, antagonists, antisense RNAs and ribozymes may 
upregulate ligand (e.g., HSP)-a2MR interaction, activity, or expression, and would enhance 
the uptake of antigen complexes (e.g., HSP-antigen complexes), and therefore be useful in 
stimulating the host's immune system prior to, or concurrent with, the administration of a 
vaccine. Described below are methods and compositions for the use of such compounds in 
the treatment of HSP-a2M receptor-related disorders, such as immune disorders, 
proliferative disorders, and infectious diseases. 

In one embodiment an antagonist of a2M receptor-ligand (e.g. , HSP-oc2M receptor) 
interaction is used to block the immune response. Such antagonists include compounds that 
interfere with binding of a ligand (e.g., an HSP) to the receptor by competing for binding to 

35 

the oc2M receptor, the ligand, or the ligand-a2M receptor complex. 
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In one embodiment, the antagonist is an antibody specific for the a2M receptor, or a 
fragment thereof which contains the HSP ligand binding site. In another embodiment the 
antagonist is an antibody specific for an HSP, which interferes with binding of the HSP to 
the receptor. 

5 In another embodiment, the antagonist is an peptide which comprises at least 

contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the ligand 
binding site of the ct2M receptor a block the interaction of an HSP or HSP complex. In 
another embodiment, the antagonist is a peptide which comprises at least contiguous 10 
amino acids of a2M sequence, which, like an HSP, can bind to the a2M receptor and 

IQ interfere with the binding and uptake of HSP-antigen complexes. In yet another embodiment, 
the antagonist is a peptide which comprises at least contiguous 10 amino acids of a2M 
receptor sequence, in particular the ECD of the a2M receptor (or a portion thereof), which 
can bind to and "neutralize" natural ligands, such as HSPs, a2M, LDL, etc. 

Such peptides may be produced synthetically or by using standard molecular biology 

j 5 techniques. Amino acid sequences and nucleotide sequences of naturally occurring a2M 
receptor ligands, such as a2M and HSPs, are generally available in sequence databases, such 
as GenBank. Computer programs, such as Entrez, can be used to browse the database, and 
retrieve any amino acid sequence and genetic sequence data of interest by accession number. 
Methods for recombinant and synthetic production of such peptides are described in Sections 

20 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 
described hereinabove, in Section 5.2, that are capable of modulating a2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
of skill in the art. 

25 

5.6.1.1 COMPETITIVE ANTAGONISTS OF a2MR-LIGAND 
INTERACTIONS 

In one embodiment an antagonist of an a2Mr-ligand (e.g., HSP- oc2M receptor) 
interaction is used to block the immune response to an antigen complex, e.g., to treat an auto- 

30 immune disorder. Such antagonists include molecules that interfere with binding by binding 
to the a2M receptor, thereby interfering with binding of a ligand (e.g., HSP) to the receptor. 
An example of this type of competitive inhibitor is an antibody to a2M receptor, or a 
fragment of a2MR which contains an HSP ligand binding site. Another example of a 
competitive antagonist is cc2M, or a receptor-binding fragment thereof, which itself binds to 

35 a2MR, thereby blocking the binding and uptake of HSP-antigen complexes by the cell. 

An oc2MR-ligand (e.g., HSP) competitive inhibitor can be any type of molecule, 
including but not limited to a protein, nucleic acid or drug. In a preferred embodiment, an 
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HSP-a2M competitive inhibitor is an a2MR-binding or an HSP-binding peptide. Examples 
of such peptides are provided below. 

5.6.1.1.1 a2M RECEPTOR-BINDING PEPTIDES 

5 a Macroglobulin peptides 

In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
an a macroglobulin, preferably a2M, or a2MR-binding portion thereof. 

Functional expression of cc2M or a2MR-binding portions thereof (including 
recombinant expression as a FX fusion protein, processing, purification and refolding) is 

10 preferably carried out as described by Holtet et al, 1994, FEBS Lett. 344:242-246. 

In a specific mode of the embodiment, an cc2MR-binding portion of oc2M consists of 
or comprises a fragment of the a2M RBD consisting of at least 10 (continuous) amino acids. 
In other modes of the embodiment, the fragment consists of at least 20, 30, 40, 50, 75 or 100 
amino acids of the RBD. In specific modes of the embodiment, such fragments are not larger 

25 than 27, 138 or 153 amino acids. Most preferred peptides comprise one or both of amino 
acids Lys 1370 and Lys 1374 . Such peptides include those consisting of amino acids 1299-1451 
(vRBD in FIG. 7B) (SEQ ID NO:8), 1314-1451 (SEQ ID NO:9) (RBD in FIG. 7B) or 1366- 
1392 (SEQ ID NO: 10) of the mature a2M protein. Other preferred peptides include but are 
not limited to those consisting of amino acids 1300-1425 (SEQ ID NO:ll), 1300-1400 (SEQ 

20 ID NO:12), 1300-1380 (SEQ ID NO:13), 1325-1425 (SEQ ID NO:14), 1325-1400 (SEQ ID 
NO: 15), 1325-1380 (SEQ ID NO: 16), 1350-1425 (SEQ ID NO: 17), 1350-1400 (SEQ ID 
NO:18), or 1350-1380 (SEQ ID NO:19) of the mature human a2M protein. 

Derivatives or analogs of a2M or a2MR-binding portions of cc2M are also 
contemplated as competitive antagonists of HSP-a2MR complexes. Such derivative or 

25 analogs include but are not limited to those molecules comprising regions that are 

substantially homologous to cc2M, the a2M RBD or fragments thereof {e.g. , in various 
embodiments, at least 60% or 70% or 80% or 90% or 95% identity over an amino acid 
sequence of identical size or when compared to an aligned sequence in which the alignment 
is done by a computer homology program known in the art) or whose encoding nucleic acid 

30 is capable of hybridizing to a coding a2M RBD sequence, under stringent, moderately 

stringent, or nonstringent conditions. In certain specific embodiments, an a2M derivative is 
a chimeric or fusion protein comprising an a2M protein or a2MR-binding portion thereof 
(preferably consisting of at least 10 amino acids of the cc2M RBD comprising Lys B70 and 
Lys 1374 ) joined at its amino- or carboxy-terminus via a peptide bond to an amino acid 

3 5 sequence of a different protein. 

In particular, ce2M derivatives can be made by altering a2M coding sequences by 
substitutions, additions or deletions that provide for functionally equivalent molecules. Due 
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to the degeneracy of nucleotide coding sequences, other DNA sequences which encode 
substantially the same amino acid sequence as a a2M gene may be used in the practice of the 
present invention. These include but are not limited to nucleotide sequences comprising all 
or a2MR-binding portions of a2M genes which are altered by the substitution of different 
codons that encode a functionally equivalent amino acid residue within the sequence, thus 
producing a silent change. Likewise, the a2M derivatives of the invention include, but are 
not limited to, those containing, as a primary amino acid sequence, all or an a2MR-binding 
portion of the amino acid sequence of an a2M protein, including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within the sequence 
resulting in a silent change. For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a similar polarity which acts as a 
functional equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 
valine, proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) amino acids include arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

The oc2M derivatives and analogs of the invention can be produced by various 
methods known in the art. The manipulations which result in their production can occur at 
the gene or protein level. For example, the cloned a2M gene sequence can be modified by 
any of numerous strategies known in the art (Maniatis, T., 1990, Molecular Cloning, A 
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York). The sequence can be cleaved at appropriate sites with restriction endonuclease(s), 
followed by further enzymatic modification if desired, isolated, and ligated in vitro. In the 
production of the gene encoding a derivative or analog of a2M, care should be taken to 
ensure that the modified gene remains within the same translational reading frame as a2M, 
uninterrupted by translational stop signals, in the gene region where the desired cc2M activity 
is encoded. 

Manipulations of the oc2M sequence may also be made at the protein level. Included 
within the scope of the invention are oc2M protein fragments or other derivatives or analogs 
which are differentially modified during or after translation, e.g., by glycosylation, 
acetylation, phosphorylation, amidation, derealization by known protecting/blocking groups, 
proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of 
numerous chemical modifications may be carried out by known techniques, including but not 
limited to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, 
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V8 protease, NaBH 4 ; acetylation, formylation, oxidation, reduction; metabolic synthesis in 
the presence of tunicamycin; etc. 

In addition, analogs and derivatives of <x2M can be chemically synthesized. For 
example, an a2MR-binding portion of a2M can be synthesized by use of a peptide 

5 synthesizer. Furthermore, if desired, nonclassical amino acids or chemical amino acid 

analogs can be introduced as a substitution or addition into the a2M sequence. Non-classical 
amino acids include but are not limited to the D-isomers of the common amino acids, a- 
amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 
6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3 -amino propionic acid, ornithine, 

j q norleucine, norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, (3-alanine, fluoro-amino acids, designer 
amino acids such as P-methyl amino acids, Ccc-methyl amino acids, Noc-methyl amino acids, 
and amino acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L 
(levorotary). 

15 In other specific modes of the embodiment, an HSP-oc2MR competitive antagonist is 

another a macroglobulin or a2MR-binding portion thereof, for example an a macro globulin 
RBD domain selected from Nielsen et al, supra, Fig. 3, Group A. 

RAP 

20 In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 

a2MR-associated protein (RAP) (Genbank accession no. A39875) or an a2MR-binding 
portion thereof. In a specific mode of the embodiment, an a2MR-binding portion of RAP 
consists of or comprises a fragment of the RAP RBD consisting of at least 10 (continuous) 
amino acids. In other modes of the embodiment, the fragment consists of at least 20, 30, 40, 

25 50, 75 or 100 amino acids of the RBD. In specific modes of the embodiment, such fragments 
are not larger than 28, 50 or 100 amino acids. In other specific modes of the embodiment, an 
a2MR-binding portion of RAP comprises an a2MR-binding portion of domain 1 or 3, e.g. as 
depicted in Nielsen et al, supra, Fig. 3, Group D or E. Expression of recombinant RAP or 
an a2MR-binding portion thereof, e.g. domain 1 or 3, is preferably achieved as described by 

3 q Andersen et al. , supra). 

5.6.1.1.2 HSP-BINDING PEPTIDES 

a2MR peptides 

In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
35 cc2MR peptide, preferably a soluble peptide, that can bind to HSPs and therefore 
competitively inhibit HSP binding to the native receptor. 
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Functional expression of HSP-binding portions of a2MR is preferably carried out as 
described for the CR8 domain by Huang et al, 1999, J. Biol. Chem 274:14130-14136. 
Briefly, to maintain proper folding, the protein is expressed as a GST fusion, expressed 
recombinantly, the GST portion cleaved, uncleaved protein removed on GSH-Sepharose, and 

5 cleaved protein refolded. Since the complement repeats bind to calcium, proper folding is 
assayed by measuring the binding of the refolded protein to calcium. 

In a specific mode of the embodiment, an HSP-binding portion of a2MR consists of 
or comprises at least one complement repeat, most preferably selected from CR3-CR10. In 
another specific mode of the embodiment, an HSP-binding portion of a2MR comprises a 

1 0 cluster of complement repeats, most preferably Cl-II. In other modes of the embodiment, the 
HSP-binding portion consists of at least 10, more preferably at least 20, yet more preferably 
at least 30, yet more preferably at least 40, and most preferably at least 80 (continuous) 
amino acids. In specific modes of the embodiment, such fragments are not larger than 40-45 
amino acids. In other specific modes of the embodiment, such fragments are not larger than 

j 5 80-90 amino acids. Exemplary preferred peptides include but are not limited to those 

consisting of amino acids 25-68 (SEQ ID NO:20), 25-1 10 (SEQ ID NO:21), 68-110 (SEQ ID 
NO:22), 853-894 (SEQ ED NO:23), 853-934 (SEQ ID NO:24), 853-974 (SEQ ID NO:25), 
853-1013 (SEQ ID NO:26), 853-1060 (SEQ ID NO:27), 853-1102 (SEQ ID NO:28), 853- 
1183 (SEQ ID NO:29), 895-934 (SEQ ED NO:30), 895-974 (SEQ ID NO:31), 895-1013 

20 (SEQ ID NO:32), 895-1060 (SEQ ED NO:33), 895-1 102 (SEQ ID NO:34), 895-1 183 (SEQ 
ID NO:35), 935-974 (SEQ ID NO:36), 935-1013 (SEQ ID NO:37), 935-1060 (SEQ ID 
NO:38), 935-1102 (SEQ ID NO:39), 935-1183 (SEQ ID NO:40), 975-1013 (SEQ ID 
NO:41), 975-1060 (SEQ ID NO:42), 975-1143 (SEQ ID NO:43), 975-1183 (SEQ ED 
NO:44), 1014-1060 (SEQ ED NO:45), 1014-1102 (SEQ ID NO:46), 1014-1183 (SEQ ED 

25 NO:47), 1061-1102 (SEQ ID NO:48), 1061-1143 (SEQ ID NO:49), 1061-1183 (SEQ ED 
NO:50), 1103-1143 (SEQ IDNO:51), 1103-1183 (SEQ ID NO: 52), or 1144-1183 (SEQ ID 
NO:53) of human a2MR. 

Derivatives or analogs of HSP-binding portions cc2MR also contemplated as 
competitive antagonists of HSP-a2MR complexes. Such derivative or analogs include but 

30 are not limited to those molecules comprising regions that are substantially homologous to 
the extracellular domain of a2MR or fragments thereof (e.g. , in various embodiments, at 
least 60% or 70% or 80% or 90% or 95% identity over an amino acid sequence of identical 
size or when compared to an aligned sequence in which the alignment is done by a computer 
homology program known in the art) or whose encoding nucleic acid is capable of 

35 hybridizing to a sequence encoding an cc2MR HSP-binding sequence, under stringent, 

moderately stringent, or nonstringent conditions. In certain specific embodiments, an cc2MR 
derivative is a chimeric or fusion protein comprising an HSP-binding portion of a2MR, 
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preferably consisting of at least one complement repeat of Cl-II) joined at its amino- or 
carboxy-terminus via a peptide bond to an amino acid sequence of a different protein. Such a 
chimeric protein can be produced recombinantly as described above, by omitting the 
cleavage repurification steps. 

5 Other HSP-binding a2MR derivatives can be made by altering a2MR coding 

sequences by substitutions, additions or deletions that provide for functionally equivalent 
molecules. Due to the degeneracy of nucleotide coding sequences, other DNA sequences 
which encode substantially the same amino acid sequence as an HSP-binding oc2MR gene or 
gene fragment may be used in the practice of the present invention. Selection of suitable 

1 0 alterations and production of HSP-binding a2MR derivatives can be made applying the same 
principles described above for oc2M derivatives and using the general methods described in 
Sections 5.1.1 and 5.1.2. 

HSP peptides 

15 In another mode of the embodiment, the antagonist is an peptide which comprises at 

least contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the ligand 
binding site of the a2M receptor a block the interaction of an HSP or HSP complex. 

Such peptides may be produced synthetically or by using standard molecular biology 
techniques. Amino acid sequences and nucleotide sequences of naturally occurring HSPs are 

20 generally available in sequence databases, such as GenBank. Computer programs, such as 
Entrez, can be used to browse the database, and retrieve any amino acid sequence and genetic 
sequence data of interest by accession number. Methods for recombinant and synthetic 
production of such peptides are described in Sections 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 

25 described hereinabove, in Section 5.2, that are capable of modulating a2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
of skill in the art. 

5.6.2 THERAPEUTIC USE OF THE a2M RECEPTOR AGAINST CANCER 
AND INFECTIOUS DISEASES 

30 

In another embodiment, symptoms of certain a2M receptor gene disorders, such as 
autoimmune disorders, or proliferative or differentiative disorders causing turnorigenesis or 
cancer, may be ameliorated by modulating the level of a2M receptor gene expression and/or 
a2M receptor gene product activity. In one embodiment, for example, a decrease in cc2M 
35 receptor gene expression may be useful to decrease a2M receptor activity, and ameliorate the 
symptoms of an autoimmune disorder. In this case, the level of a2M receptor gene 
expression may be decreased by using a2M receptor gene sequences in conjunction with 
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well-known antisense, gene "knock-out," ribozyme and/or triple helix methods. In another 
embodiment, an increase in a2M receptor gene expression may be desired to compensate for 
a mutant or impaired gene in an HSP-a2M receptor-mediated pathway, and to ameliorate the 
symptoms of an HSP- cc2M receptor-related disorder. 

5 Among the compounds that may exhibit the ability to modulate the activity, 

expression or synthesis of the a2M receptor gene, including the ability to ameliorate the 
symptoms of an HSP-a2M receptor related disorder are antisense, ribozyme, and triple helix 
molecules. Such molecules may be designed to reduce or inhibit either unimpaired, or if 
appropriate, mutant target gene activity. Techniques for the production and use of such 

10 molecules are well known to those of skill in the art. 

Antisense RNA and DNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. Antisense approaches 
involve the design of oligonucleotides that are complementary to a target gene mRNA. The 
antisense oligonucleotides will bind to the complementary target gene mRNA transcripts and 

1 5 prevent translation. Absolute complementarity, although preferred, is not required. 

A sequence "complementary" to a portion of an RNA, as referred to herein, means a 
sequence having sufficient complementarity to be able to hybridize with the RNA, forming a 
stable duplex; in the case of double-stranded antisense nucleic acids, a single strand of the 
duplex DNA may thus be tested, or triplex formation may be assayed. The ability to 

20 hybridize will depend on both the degree of complementarity and the length of the antisense 
nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base mismatches 
with an RNA it may contain and still form a stable duplex (or triplex, as the case may be). 
One skilled in the art can ascertain a tolerable degree of mismatch by use of standard 
procedures to determine the melting point of the hybridized complex. 

25 In one embodiment, oligonucleotides complementary to non-coding regions of the 

<x2M receptor gene could be used in an antisense approach to inhibit translation of 
endogenous a2M receptor mRNA. Antisense nucleic acids should be at least six nucleotides 
in length, and are preferably oligonucleotides ranging from 6 to about 50 nucleotides in 
length. In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 

30 nucleotides, at least 25 nucleotides or at least 50 nucleotides. 

In an embodiment of the present invention, oligonucleotides complementary to the 
nucleic acids encoding the HSP receptor ligand binding domain are used. 

Regardless of the choice of target sequence, it is preferred that in vitro studies are 
first performed to quantitate the ability of the antisense oligonucleotide to inhibit gene 

35 expression. It is preferred that these studies utilize controls that distinguish between antisense 
gene inhibition and nonspecific biological effects of oligonucleotides. It is also preferred that 
these studies compare levels of the target RNA or protein with that of an internal control 
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RNA or protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from the 
5 antisense sequence no more than is necessary to prevent specific hybridization to the target 
sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 

10 stability of the molecule, hybridization, etc. The oligonucleotide may include other 

appended groups such as peptides (e.g. , for targeting host cell receptors in vivo), or agents 
facilitating transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. 
Acad. Sci. U.S.A. 86, 6553-6556; Lemaitre etal., 1987, Proc. Natl. Acad. Sci. 84, 648-652; 
PCT Publication No. WO88/09810, published December 15, 1988) or the blood-brain barrier 

15 (see, e.g., PCT Publication No. WO89/10134, published April 25, 1988), hybridization- 
triggered cleavage agents (see, e.g., Krol et al, 1988, BioTechniques 6, 958-976) or 
intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5, 539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

20 The antisense oligonucleotide may comprise at least one modified base moiety which 

is selected from the group including but not limited to 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5 -carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 

25 N6-isopentenyladenine, 1 -methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

30 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 

5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, 
35 and hexose. 

In yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group consisting of a phosphorothioate (S- 
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ODNs), a phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
5 oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids 
with complementary RNA in which, contrary to the usual P-units, the strands run parallel to 
each other (Gautier et al., 1987, Nucl. Acids Res. 15, 6625-6641). The oligonucleotide is a 
2'-0-methylribonucleotide (Inoue etal., 1987, Nucl. Acids Res. 15, 6131-6148), or a 
chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 215, 327-330). 
2 o Oligonucleotides of the invention may be synthesized by standard methods known in 

the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al. (1988, Nucl. Acids Res. 16, 3209), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 
15 supports (Sarin et al, 1988, Proc. Natl. Acad. Sci. U.S.A. 85, 7448-7451), etc. 

While antisense nucleotides complementary to the target gene coding region 
sequence could be used, those complementary to the transcribed, untranslated region are 
most preferred. 

In one embodiment of the present invention, gene expression downregulation is 

20 achieved because specific target mRNAs are digested by RNAse H after they have 

hybridized with the antisense phosphorothioate oligonucleotides (S-ODNs). Since no rules 
exist to predict which antisense S-ODNs will be more successful, the best strategy is 
completely empirical and consists of trying several antisense S-ODNs. Antisense 
phosphorothioate oligonucleotides (S-ODNs) will be designed to target specific regions of 

25 mRNAs of interest. Control S-ODNs consisting of scrambled sequences of the antisense S- 
ODNs will also be designed to assure identical nucleotide content and minimize differences 
potentially attributable to nucleic acid content. All S-ODNs can be synthesized by Oligos 
Etc. (Wilsonville, OR). In order to test the effectiveness of the antisense molecules when 
applied to cells in culture, such as assays for research purposes or ex vivo gene therapy 

30 protocols, cells will be grown to 60-80% confluence on 100 mm tissue culture plates, rinsed 
with PBS and overlaid with lipofection mix consisting of 8 ml Opti-MEM, 52.8 /A 
Lipofectin, and a final concentration of 200 nM S-ODNs. Lipofections will be carried out 
using Lipofectin Reagent and Opti-MEM (Gibco BRL). Cells will be incubated in the 
presence of the lipofection mix for 5 hours. Following incubation the medium will be 

35 replaced with complete DMEM. Cells will be harvested at different time points post- 
lipofection and protein levels will be analyzed by Western blot. 
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Antisense molecules should be targeted to cells that express the target gene, either 
directly to the subject in vivo or to cells in culture, such as in ex vivo gene therapy protocols. 
A number of methods have been developed for delivering antisense DNA or RNA to cells; 
e.g., antisense molecules can be injected directly into the tissue site, or modified antisense 

5 molecules, designed to target the desired cells {e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 
administered systemically. 

However, it is often difficult to achieve intracellular concentrations of the antisense 
sufficient to suppress translation of endogenous mRNAs. Therefore a preferred approach 

10 utilizes a recombinant DNA construct in which the antisense oligonucleotide is placed under 
the control of a strong pol III or pol II promoter. The use of such a construct to transfect 
target cells in the patient will result in the transcription of sufficient amounts of single 
stranded RNAs that will form complementary base pairs with the endogenous target gene 
transcripts and thereby prevent translation of the target gene mRNA. For example, a vector 

1 5 can be introduced e.g. , such that it is taken up by a cell and directs the transcription of an 
antisense RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in mammalian 

20 cells. Expression of the sequence encoding the antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human cells. Such promoters can be 
inducible or constitutive. Such promoters include but are not limited to: the SV40 early 
promoter region (Bernoist and Chambon, 1981, Nature 290, 304-310), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus (Y amamoto et al, 1980, Cell 

25 22, 787-797), the herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. 
Sci. U.S.A. 78, 1441-1445), the regulatory sequences of the metallothionein gene (Brinster et 
al, 1982, Nature 296, 39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be 
used to prepare the recombinant DNA construct which can be introduced directly into the 
tissue site. Alternatively, viral vectors can be used that selectively infect the desired tissue, 

30 in which case administration may be accomplished by another route {e.g., systemically). 

Ribozyme molecules designed to catalytically cleave target gene mRNA transcripts 
can also be used to prevent translation of target gene mRNA and, therefore, expression of 
target gene product. (See, e.g., PCT International Publication WO90/1 1364, published 
October 4, 1990; Sarver et al, 1990, Science 247, 1222-1225). In an embodiment of the 

35 present invention, oligonucleotides which hybridize to the HSP receptor gene are designed to 
be complementary to the nucleic acids encoding the HSP receptor ligand binding domain. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage 
of RNA. (For a review, see Rossi, 1994, Current Biology 4, 469-471). The mechanism of 
ribozyme action involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 

5 composition of ribozyme molecules must include one or more sequences complementary to 
the target gene mRNA, and must include the well known catalytic sequence responsible for 
mRNA cleavage. For this sequence, see, e.g., U.S. Patent No. 5,093,246, which is 
incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific recognition sequences can be 

10 used to destroy target gene mRNAs, the use of hammerhead ribozymes is preferred. 

Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5-UG-3'. The construction and 
production of hammerhead ribozymes is well known in the art and is described more fully in 

1 5 Myers, 1995, Molecular Biology and Biotechnology: A Comprehensive Desk Reference, 
VCH Publishers, New York, (see especially fig. 4, p. 833) and in Haseloff & Gerlach, 1988, 
Nature, 334, 585-591, which is incorporated herein by reference in its entirety. 

Preferably the ribozyme is engineered so that the cleavage recognition site is located 
near the 5' end of the target gene mRNA, i.e., to increase efficiency and minimize the 

20 intracellular accumulation of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one that occurs naturally in Tetrahymena 
thermophila (known as the IVS, or L-19 IVS RNA) and that has been extensively described 
by Thomas Cech and collaborators (Zaug et a!., 1984, Science, 224, 574-578; Zaug and 

25 Cech, 1986, Science, 231, 470-475; Zaug et al, 1986, Nature, 324, 429-433; published 

International patent application No. WO 88/04300 by University Patents Inc.; Been & Cech, 
1986, Cell, 47, 207-216). The Cech-type ribozymes have an eight base pair active site which 
hybridizes to a target RNA sequence whereafter cleavage of the target RNA takes place. The 
invention encompasses those Cech-type ribozymes which target eight base-pair active site 

30 sequences that are present in the target gene. 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides {e.g., for improved stability, targeting, etc.) and should be delivered to cells 
that express the target gene in vivo. A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol III or pol II 

35 promoter, so that transfected cells will produce sufficient quantities of the ribozyme to 

destroy endogenous target gene messages and inhibit translation. Because ribozymes unlike 
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antisense molecules, are catalytic, a lower intracellular concentration is required for 
efficiency. 

Endogenous target gene expression can also be reduced by inactivating or "knocking 
out" the target gene or its promoter using targeted homologous recombination (e.g., see 

5 Smithies et al, 1985, Nature 317, 230-234; Thomas & Capecchi, 1987, Cell 51, 503-512; 
Thompson et al, 1989, Cell 5, 313-321; each of which is incorporated by reference herein in 
its entirety). For example, a mutant, non-functional target gene (or a completely unrelated 
DNA sequence) flanked by DNA homologous to the endogenous target gene (either the 
coding regions or regulatory regions of the target gene) can be used, with or without a 

10 selectable marker and/or a negative selectable marker, to transfect cells that express the target 
gene in vivo. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the target gene. Such approaches are particularly suited 
modifications to ES (embryonic stem) cells can be used to generate animal offspring with an 
inactive target gene (e.g., see Thomas & Capecchi, 1987 and Thompson, 1989, supra). 

j 5 However this approach can be adapted for use in humans provided the recombinant DNA 
constructs are directly administered or targeted to the required site in vivo using appropriate 
viral vectors. 

Alternatively, endogenous target gene expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of the target gene 

20 (i-e., the target gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the target gene in target cells in the body. (See generally, Helene, 1991, 
Anticancer Drug Des., 6(6), 569-584; Helene et al, 1992, Ann. N.Y. Acad. Sci., 660, 27-36; 
and Maher, 1992, Bioassays 14(12), 807-815). 

Nucleic acid molecules to be used in triple helix formation for the inhibition of 

25 transcription should be single stranded and composed of deoxyribonucleotides. The base 
composition of these oligonucleotides must be designed to promote triple helix formation via 
Hoogsteen base pairing rules, which generally require sizeable stretches of either purines or 
pyrimidines to be present on one strand of a duplex. Nucleotide sequences may be 
pyrimidine-based, which will result in TAT and CGC + triplets across the three associated 

30 strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are purine- 
rich, for example, contain a stretch of G residues. These molecules will form a triple helix 
with a DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 

35 located on a single strand of the targeted duplex, resulting in GGC triplets across the three 
strands in the triplex. 
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Alternatively, the potential sequences that can be targeted for triple helix formation 
may be increased by creating a so called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5'-3', 3'-5' manner, such that they base pair with 
first one strand of a duplex and then the other, eliminating the necessity for a sizeable stretch 

5 of either purines or pyrimidines to be present on one strand of a duplex. 

In instances wherein the antisense, ribozyme, and/or triple helix molecules described 
herein are utilized to inhibit mutant gene expression, it is possible that the technique may so 
efficiently reduce or inhibit the transcription (triple helix) and/or translation (antisense, 
ribozyme) of mRNA produced by normal target gene alleles that the possibility may arise 

10 wherein the concentration of normal target gene product present may be lower than is 

necessary for a normal phenotype. In such cases, to ensure that substantially normal levels of 
target gene activity are maintained, therefore, nucleic acid molecules that encode and express 
target gene polypeptides exhibiting normal target gene activity may, be introduced into cells 
via gene therapy methods such as those described, below, in Section 5.6.3 that do not contain 

j5 sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are being 
utilized. Alternatively, in instances whereby the target gene encodes an extracellular protein, 
it may be preferable to co-administer normal target gene protein in order to maintain the 
requisite level of target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may 

20 be prepared by any method known in the art for the synthesis of DNA and RNA molecules, 
as discussed above. These include techniques for chemically synthesizing oligodeoxyri- 
bonucleotides and oligoribonucleotides well known in the art such as for example solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in 
vitro and in vivo transcription of DNA sequences encoding the antisense RNA molecule. 

25 Such DNA sequences may be incorporated into a wide variety of vectors that incorporate 
suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 
inducibly, depending on the promoter used, can be introduced stably into cell lines. 



5.6.3 GENE REPLACEMENT THERAPY 

With respect to an increase in the level of normal a2M receptor gene expression 
and/or a2M receptor gene product activity, a2M receptor gene nucleic acid sequences can, 
for example, be utilized for the treatment of immune disorders resulting in proliferative 
disorders such as cancer. Such treatment can be administered, for example, in the form of 
gene replacement therapy. Specifically, one or more copies of a normal cc2M receptor gene 
or a portion of the a2M receptor gene that directs the production of an cc2M receptor gene 
product exhibiting normal a2M receptor gene function, may be inserted into the appropriate 
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cells within a patient, using vectors that include, but are not limited to adenovirus, adeno- 
associated virus, and retrovirus vectors, in addition to other particles that introduce DNA into 
cells, such as liposomes. 

Gene replacement therapy techniques should be capable of delivering a2M receptor 

5 gene sequences to cell types that express the HSP receptor within patients. Thus, in one 
embodiment, techniques that are well known to those of skill in the art (see, e.g., PCT 
Publication No. WO89/10134, published April 25, 1988) can be used to enable a2M receptor 
gene sequences to be delivered to developing cells of the myeloid lineage, for example, to the 
bone marrow. In another specific embodiment, gene replacement can be accomplished using 

!0 macrophages in vitro, and delivered to a patient using the techniques of adoptive 
immunotherapy. 

In another embodiment, techniques for delivery involve direct administration of such 
cc2M receptor gene sequences to the site of the cells in which the a2M receptor gene 
sequences are to be expressed, e.g., directly at the site of the tumor. 

j 5 Additional methods that may be utilized to increase the overall level of a2M receptor 

gene expression and/or oc2M receptor gene product activity include the introduction of 
appropriate cc2M receptor-expressing cells, preferably autologous cells, into a patient at 
positions and in numbers that are sufficient to ameliorate the symptoms of an cc2M receptor 
disorder. Such cells may be either recombinant or non-recombinant. 

20 Among the cells that can be administered to increase the overall level of a2M 

receptor gene expression in a patient are cells that normally express the a2M receptor gene. 

Alternatively, cells, preferably autologous cells, can be engineered to express a2M 
receptor gene sequences, and may then be introduced into a patient in positions appropriate 
for the amelioration of the symptoms of an a2M receptor disorder or a proliferative or viral 

25 disease, e.g., cancer and tumorigenesis. Alternately, cells that express an unimpaired a2M 
receptor gene and that are from a MHC matched individual can be utilized, and may include, 
for example, brain cells. The expression of the a2M receptor gene sequences is controlled 
by the appropriate gene regulatory sequences to allow such expression in the necessary cell 
types. Such gene regulatory sequences are well known to the skilled artisan. Such cell-based 

30 gene therapy techniques are well known to those skilled in the art, see, e.g., Anderson, U.S. 
Patent No. 5,399,349. 

When the cells to be administered are non-autologous cells, they can be administered 
using well known techniques that prevent a host immune response against the introduced 
cells from developing. For example, the cells may be introduced in an encapsulated form 

35 which, while allowing for an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be recognized by the host immune 
system. 
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5 .6.4 DELIVERY OF SOLUBLE oc2M RECEPTOR POLYPEPTIDES 
Genetically engineered cells that express soluble a2M receptor ECDs or fusion 

proteins, e.g., fusion Ig molecules can be administered in vivo where they may function as 
"bioreactors" that deliver a supply of the soluble molecules. Such soluble a2M receptor 
5 polypeptides and fusion proteins, when expressed at appropriate concentrations, should 
neutralize or "mop up" HSPs or other native ligand for the a2M receptor, and thus act as 
inhibitors of cc2M receptor activity and may therefore be used to treat HSP-a2M receptor- 
related disorders and diseases, such as autoimmune disorders, proliferative disorders, and 
infectious diseases. 

10 

5.6.5 DELIVERY OF DOMINANT NEGATIVE MUTANTS 

In another embodiment of the invention, dominant negative mutants ("dominant 
negatives") may be used therapeutically to block the immune response to an HSP-antigen 
complex, e.g., to treat an auto-immune disorder. In general, such dominant-negatives are 

1 5 mutants which, when expressed, interact with ligand (i. e. , HSP-antigenic molecule 
complex), but lack one or more functions, i.e. endocytotic functions and/or signaling 
functions, of normal cc2MR. Such mutants interfere with the function of normal a2MR in 
the same cell or in a different cell, e.g. by titration of HSP-peptide complexes from the wild 
type receptor. Such a mutation, for example, can be one or more point mutation(s), a 

20 deletion, insertion, or other mutation in either the extracellular of the 5 15 kDa subunit, or the 
extracellular, transmembrane or intracellular domains of the 85 kDa subunit of the alpha(2) 
macroglobulin receptor {see Krieger and Herz, 1994, Annu. Rev. Biochem 63:601-637 for 
oc2MR subunit configuration). However, in construction of dominant negative mutations in 
the either subunit, care should be taken to ensure that the cleavage domain (signaling 

25 cleavage between aas 3525 and 3526 of the precursor of oc2MR) remains intact so that the 
5 1 5 kDa subunit is processed and presented on the cell surface. Additionally, care should be 
taken to ensure that the domains by which the two subunits associate should also remain 
functional. For example, in a specific embodiment, the C-terminal intracellular domain of 
the 85 kDa subunit is truncated. In another embodiment, a point mutation on the N-terminal 

30 515 kDa subunit blocks endocytosis but not ligand binding. In another embodiment, the N- 
terminal 515 kDa subunit is expressed as a fusion protein, wherein the C-terminus of said 
fusion protein is the transmembrane domain and optionally the intracellular domain, of 
another Type I single transmembrane receptor. 

Expression of a such a dominant negative mutation in cell can block uptake of ligand 

35 by normal functional receptors in the same or neighboring cells by titrating out the amount of 
available ligand. Thus, a recombinant antigen presenting cell expressing such a dominant 
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negative can be used to titrate out HSP-antigenic molecule complexes when administered to 
a patient in need of treatment for an autoimmune disorder. 

5 . 7 TARGET AUTOIMMUNE DISEASES 

5 Autoimmune diseases that can be treated by the methods of the present invention 

include, but are not limited to, insulin dependent diabetes mellitus {i.e., IDDM, or 
autoimmune diabetes), multiple sclerosis, systemic lupus erythematosus, Sjogren's 
syndrome, scleroderma, polymyositis, chronic active hepatitis, mixed connective tissue 
disease, primary biliary cirrhosis, pernicious anemia, autoimmune thyroiditis, idiopathic 

10 Addison's disease, vitiligo, gluten-sensitive enteropathy, Graves' disease, myasthenia gravis, 
autoimmune neutropenia, idiopathic thrombocytopenia purpura, rheumatoid arthritis, 
cirrhosis, pemphigus vulgaris, autoimmune infertility, Goodpasture's disease, bullous 
pemphigoid, discoid lupus, ulcerative colitis, and dense deposit disease. The diseases set 
forth above, as referred to herein, include those exhibited by animal models for such 

j 5 diseases, such as, for example non-obese diabetic (NOD) mice for IDDM and experimental 
autoimmune encephalomyelitis (EAE) mice for multiple sclerosis. 

The methods of the present invention can be used to treat such autoimmune diseases 
by reducing or eliminating the immune response to the patient's own (self) tissue, or, 
alternatively, by reducing or eliminating a pre-existing autoimmune response directed at 

20 tissues or organs transplanted to replace self tissues or organs damaged by the autoimmune 
response. 

5.8 TARGET INFECTIOUS DISEASES 

The infectious diseases that can be treated or prevented using the methods and 
25 compositions of the present invention include those caused by intracellular pathogens such as 
viruses, bacteria, protozoans, and intracellular parasites. Viruses include, but are not limited 
to viral diseases such as those caused by hepatitis type B virus, parvoviruses, such as 
adeno-associated virus and cytomegalovirus, papovaviruses such as papilloma virus, 
polyoma viruses, and SV40, adenoviruses, herpes viruses such as herpes simplex type I 
30 (HSV-I), herpes simplex type II (HSV-II), and Epstein-Barr virus, poxviruses, such as 
variola (smallpox) and vaccinia virus, RNA viruses, including but not limited to human 
immunodeficiency virus type I (HIV-I), human immunodeficiency virus type II (HIV-II), 
human T-cell lymphotropic virus type I (HTLV-I), and human T-cell lymphotropic virus 
type II (HTLV-II); influenza virus, measles virus, rabies virus, Sendai virus, picornaviruses 
3 5 such as poliomyelitis virus, coxsackieviruses, rhino viruses, reo viruses, togaviruses such as 
rubella virus (German measles) and Semliki forest virus, arboviruses, and hepatitis type A 
virus. 
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In another embodiment, bacterial infections can be treated or prevented such as, but 
not limited to disorders caused by pathogenic bacteria including, but not limited to, 
Streptococcus pyogenes, Streptococcus pneumoniae, Neisseria gonorrhoea, Neisseria 
meningitidis, Corynebacterium diphtheriae, Clostridium botulinum, Clostridium perfringens, 

5 Clostridium tetani, Haemophilus influenzae, Klebsiella pneumoniae, Klebsiella ozaenae, 
Klebsiella rhinoscleromotis, Staphylococcus aureus, Vibrio cholerae, Escherichia coli, 
Pseudomonas aeruginosa, Campylobacter (Vibrio) fetus , Campylobacter jejuni, Aeromonas 
hydrophila, Bacillus cereus, Edwardsiella tarda, Yersinia enterocolitica, Yersinia pestis, 
Yersinia pseudotuberculosis, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

10 Salmonella typhiimurium, Salmonella typhii, Treponema pallidum, Treponema pertenue, 
Treponema carateneum, Borrelia vincentii, Borrelia burgdorferi, Leptospira 
icterohemorrhagiae, Mycobacterium tuberculosis, Toxoplasma gondii, Pneumocystis carinii, 
Francisella tularensis, Brucella abortus, Brucella suis, Brucella melitensis, Mycoplasma 
spp., Rickettsia prowazeki, Rickettsia tsutsugumushi, Chlamydia spp., and Helicobacter 

15 pylori. 

In another preferred embodiment, the methods can be used to treat or prevent 
infections caused by pathogenic protozoans such as, but not limited to, Entomoeba 
histolytica, Trichomonas tenas, Trichomonas hominis, Trichomonas vaginalis, Trypanosoma 
gambiense, Trypanosoma rhodesiense, Trypanosoma cruzi, Leishmania donovani, 
20 Leishmania tropica, Leishmania braziliensis, Pneumocystis pneumonia, Plasmodium vivax, 
Plasmodium falciparum, and Plasmodium malaria. 

5 .9 TARGET PROLIFERATIVE CELL DISORDERS 

With respect to specific proliferative and oncogenic disease associated with HSP- 

25 cc2M receptor activity, the diseases that can be treated or prevented by the methods of the 
present invention include, but are not limited to: human sarcomas and carcinomas, e.g., 
fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 

30 carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, cervical cancer, testicular 

35 tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, 
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hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, 
neuroblastoma, retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute 
myelocytic leukemia (myeloblastic, promyelocytic, myelomonocytic, monocytic and 
erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's disease and non- 
Hodgkin's disease), multiple myeloma, Waldenstrom's macroglobulinemia, and heavy chain 
disease. 

Diseases and disorders involving a deficiency in cell proliferation or in which cell 
proliferation is desired for treatment or prevention, and that can be treated or prevented by 
inhibiting the a2M receptor function, include but are not limited to degenerative disorders, 
growth deficiencies, hypoproliferative disorders, physical trauma, lesions, and wounds; for 
example, to promote wound healing, or to promote regeneration in degenerated, lesioned or 
injured tissues, etc. 



5.10 PHARMACEUTICAL PREPARATIONS AND METHODS OF 
ADMINISTRATION 

The compounds that are determined to affect a2M receptor gene expression or gene 
product activity can be administered to a patient at therapeutically effective doses to treat or 
ameliorate a cell proliferative disorder. A therapeutically effective dose refers to that amount 
of the compound sufficient to result in amelioration of symptoms of such a disorder. 



5.10.1 EFFECTIVE DOSE 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 
LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD 50 /ED 50 . Compounds that exhibit 
large therapeutic indices are preferred. While compounds that exhibit toxic side effects may 
be used, care should be taken to design a delivery system that targets such compounds to the 
site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, 
reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED 50 with little or no 
toxicity. The dosage may vary within this range depending upon the dosage form employed 
and the route of administration utilized. For any compound used in the method of the 
invention, the therapeutically effective dose can be estimated initially from cell culture 
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assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC 50 (i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine useful doses in humans. Levels in 
5 plasma may be measured, for example, by high performance liquid chromatography. 

5.10.2 FORMULATIONS AND USE 

Pharmaceutical compositions for use in accordance with the present invention may be 
formulated in conventional manner using one or more physiologically acceptable carriers or 
IQ excipients. 

Thus, the compounds and their physiologically acceptable salts and solvates may be 
formulated for administration by inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

For oral administration, the pharmaceutical compositions may take the form of, for 

15 example, tablets or capsules prepared by conventional means with pharmaceutically 
acceptable excipients such as binding agents (e.g., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, 
microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g., magnesium 
stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch glycolate); or 

20 wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well 
known in the art. Liquid preparations for oral administration may take the form of, for 
example, solutions, syrups or suspensions, or they may be presented as a dry product for 
constitution with water or other suitable vehicle before use. Such liquid preparations may be 
prepared by conventional means with pharmaceutically acceptable additives such as 

25 suspending agents (e.g. , sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 
emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g., methyl or propyl- 
p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 

30 Preparations for oral administration may be suitably formulated to give controlled 

release of the active compound. 

For buccal administration the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 

35 invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
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or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of e.g., gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

5 The compounds may be formulated for parenteral administration by injection, e.g. , by 

bolus injection or continuous infusion. Formulations for injection may be presented in unit 
dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or 

10 dispersing agents. Alternatively, the active ingredient may be in powder form for 
constitution with a suitable vehicle, e.g., sterile pyrogen- free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. 

15 In addition to the formulations described previously, the compounds may also be 

formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 

20 resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 

The compositions may, if desired, be presented in a pack or dispenser device that may 
contain one or more unit dosage forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device 
may be accompanied by instructions for administration. 

25 

6. EXAMPLE: IDENTIFICATION OF a2M RECEPTOR AS AN HSP RECEPTOR 

6.1 INTRODUCTION 

^ The Example presented herein describes the successful identification of an interaction 

between gp96 and the a2M receptor present in macrophages and dendritic cells. The 
experiments presented herein form the basis for isolating a2M receptor polypeptides and for 
the screening, diagnostic, and therapeutic methods of the present invention. 

The Applicant of the present invention noted that certain observations were 

^ inconsistent with a "direct transfer" model of HSP-chaperoned peptide antigen presentation. 
First, the immunogenicity of HSP preparations is dependent on the presence of functional 
phagocytic cells but not B cells or other nonprofessional antigen-presenting cells, (Udono 
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and Srivastava, 1993, supra; Suto and Srivastava, 1995, supra), whereas free peptides can 
sensitize all cell types. Second, extremely small quantities of HSP-peptide complexes were 
effective in eliciting specific immunity, i.e., gp96-chaperoned peptides are several hundred 
times as effective as free peptides in sensitizing macrophages for CTL recognition, 

5 suggesting the possibility of a specific uptake mechanism. Third, gp96-chaperoned peptides 
elicited an MHC I response that was not limited by the size of peptide. Finally, the 
processing of gp96-peptide complexes in macrophage was found to be sensitive to Brefeldin 
A (BFA), which blocks transport through the Golgi apparatus, suggesting that processing 
occurred through an intercellular mechanism. These observations led to the hypothesis that 

IQ HSP-chaperoned peptides may be processed internally and re-presented by MHC class I 
molecules on the cell surfaces of macrophages (Suto and Srivastava, 1995, supra). There is 
also the hypothesis that the mannose receptor is used in the uptake of gp96 but no 
mechanism has been proposed for the non-glycosylated HSPs, such as HSP70 (Ciupitu et al, 
1998, J. Exp. Med., 187: 685-691). Others suggested that a novel intracellular trafficking 

15 pathway may be involved for the transport of peptides from the extracellular medium into the 
lumen of ER) Day et al, 1997, Proc. Natl. Acad. Sci. 94:8065-8069; Nicchitta, 1998, Curr. 
Opin. in Immunol. 10:103-109). Further suggestions include the involvement of phagocytes 
which (a) possess an ill-defined pathway to shunt protein from the phagosome into 
thecytosol where it would enter the normal class I pathway; (b) digest ingested material in 

20 lysosomes and regurgitate peptides for loading on the surface to class I molecules (Bevan, 
1995, J. Exp. Med. 192:639-41). The discovery of a receptor for heat shock protein as 
disclosed herein helps to resolve the paradox of how extracellular antigenic peptides 
complexed to HSPs can be presented by MHC class I molecules on antigen presenting cells. 

25 6.2 MATERIALS AND METHODS 

Affinity chromatography. Proteins (lmg) in 2ml volume were incubated with 2ml of 
equilibrated AminoLink beads in PBS with a reductant (NaCNBH 3 ) for 1 hour. Uncoupled 
protein was removed by extensive washing of the column and unreactive groups quenched. 
Immobilization yields were typically >92% of the starting amount of protein. Columns were 

30 stored at 4°C until used. Such columns were made with gp96 (purified as described in 
Srivastava et al, 1986, Proc. Natl. Acad. Sci., U.S.A. 83:3407-341 1) and albumin. For 
membrane purification, cells were lysed by dounce homogenization in hypotonic buffer 
containing PMSF. Unlyzed cells and nuclei were removed by centrifugation at lOOOg for 5 
mm. The postnuclear supernatant was centrifuged at 100,000g for 90 mins. The pellet 

35 contains total membranes and was fractionated by aqueous two-phase partition with a 
dextran/polyethylene glycol biphase. Briefly membranes were resuspended in PEG (33% 
wt/wt in 0.22 M sodium phosphate buffer, pH 6.5) and underlaid gently with dextran 
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(20%wt/wt in 0.22M sodium phosphate buffer, pH 6.5). The two phases were mixed gently 
and centrifuged at 2000 g for 15 mins. The white material at the interphase was enriched for 
plasma membranes, whose proteins were extracted by 2 hr incubation in 20mM Tris buffer 
(pH8, containing 0.08% octylglucoside) at 4°C. 

5 Photo cross-linking of gp96 to putative receptor. The cross-linker (SASD, (Pierce) 

was labeled with I 125 using iodobeads (Pierce). Radiolabeled SASD was covalently attached 
to gp96 by incubation at room temperature for 1 hr. Free SASD and I 125 were removed by 
size exclusion column (KwikSep columns, Pierce). For cross-linking studies, I 125 -SASD- 
gp96 (50 ug gp96) was incubated with purified CD1 lb + cells. Unbound protein was 

IQ removed by washing. All procedures to this point were carried out in very dim light. 
Proteins were cross-linked with UV light. Cells were lysed with lysis buffer (0.5%NP4O, 
lOmM Tris, ImMEDTA, 150mM NaCl) and treated with 100 mM 2-mercaptoethanol to 
cleave the cross-linker. Cell ly sates were analyzed by SDS-PAGE and autoradiography. 

Re-presentation assays. Re-presentation assays were carried out as described (Suto 

15 and Srivastava, 1995, Science 269:1585-1588). Antigen presenting cells (RAW264.7 
macrophage cell line) were plated at a 1 : 1 ratio with AH I -specific T cells in complete 
RPMI. Approximately 10,000 cells of each type were used. Gp96 (10 ng/ml) chaperoning 
the AH 1-20 mer peptide fRVTYH SPSYVYHOF ERRAIQ was added to the cells and the 
entire culture was incubated for 20 hrs. Stimulation of T cells was measured by quantifying 

20 the amount of IFN-y released into the supernatants by ELISA (Endogen). 

Protein Micros equencing. Proteins identified by affinity chromatography were 
analyzed on SDS-PAGE and stained with coomasie blue or transferred onto PVDF 
membrane and stained with coomasie blue, all of it under keratin-free conditions. Protein 
bands were excised with a razor from the gel or membrane. Tryptic peptides from an 80kDa 

25 faint coomassie band were extracted by 50% acetonitrile, 5% formic acid, dried, and loaded 
onto a 75 m 10 cm, reverse-phase CI 8, microcapillary column (3 ul vol) and tryptic peptides 
were separated by on-line microcapillary liquid chromatography-tendem mass spectrometry 
followed by database searching using the SEQUEST program as previously described. 
(Gatlin et al.,2000, Anal. Chem. 72:757-63; Link et al, 1999, Nat. Biotechnol. 17:676-82). 

30 The analysis was carried out in a data-dependent auto-MS/MS fashion using a Finnigan LCQ 
iontrap Mass Spectrometer. 

6.3 RESULTS 

Identification of an 80 kDa protein as a potential gp96 receptor. Homogenous 
35 preparations of gp96 were coupled to FITC and the gp96-FITC was used to stain RAW264.7 
cells, shown to be functionally capable of re-presenting gp96-chaperoned peptides. Gp96- 
FITC but not control albumin-FITC preparations stained the cell surface of RAW264.7 cells 
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(FIG. 1 A). Plasma membrane preparations of cell surface-biotinylated RAW264.7 cells were 
solubilized in 0.08% octyl-glucoside and the soluble extract was applied to a gp96-Sepharose 
column. The bound proteins were eluted with 3M sodium chloride. SDS-PAGE analysis of 
the eluate showed 2 major bands of -75-80 kDa size (FIG. IB, top left). Blotting of this gel 

5 with avidin-peroxidase showed that both bands were biotinylated, indicating their surface 
localization (FIG. IB, bottom left). Affinity purification of membrane extracts of RAW264.7 
cells over control serum albumin affinity columns did not result in isolation of any proteins, 
nor did probing of immunoblots of such gels with avidin peroxidase detect any albumin- 
binding surface proteins (FIG. IB, top and bottom center lanes). As an additional control, 

10 chromatography of membrane extracts of P815 cells which do not bind gp96-FITC and 

which do not re-present gp96-chaperoned peptides, on gp96 affinity columns did not result in 
elution of any gp96-binding proteins (FIG. IB, top and bottom right lanes). 

In parallel experiments, gp96 molecules were coupled to the radio-iodinated linker 
sulfosuccinimidyl (4-azidosalicylamido) hexanoate (SASD) which contains a photo cross- 
ly 5 linkable group. Gp96-SASD-I 125 was pulsed onto peritoneal macrophages, which have been 
shown previously to re-present gp96-chaperoned peptides (Suto and Srivastava, 1995, 
Science 269: 1585-1588). Excess gp96-SASD was removed by multiple rounds of washing 
of the cells and photoactivation was carried out by exposure of cells to UV light for 10 mm. 
Cell lysates were reduced in order to transfer the I 125 group to the putative gp96 ligand and 

20 were analyzed by SDS- PAGE followed by autoradiography. The gp96 molecule was 

observed to cross-link to an ~80 kDa band specifically present in re-presentation-competent 
macrophage but not in the re-presentation-incompetent P815 cells (FIG. 1C). This band 
appears to correspond in size to the larger of the two bands seen in eluates of gp96 affinity 
columns (FIG. ID). No band corresponding to the lower band in that preparation is seen in 

25 the photo cross-linked preparation. The observation of a specific binding of gp96 to an 80 
kDa protein in two different re-presentation-competent cell types, but not in a re- 
presentation-incompetent cell line, and by two independent assays supported the candidacy 
of the 80 kDa molecule for the gp96 receptor. 

Antiserum against the 80 kDa protein inhibits re-presentation of a gp96-chaperoned 

30 antigenic peptide. The eluates containing the 75-80 kDa proteins were used to immunize a 
New Zealand white rabbit, and pre-immune and immune sera were used to probe blots of 
plasma membrane extracts of the re-presentation-competent RAW264.7 and primary 
peritoneal macrophages and the re-presentation-incompetent P815 cells. The immune but not 
the pre-immune serum detected the 80 kDa band (and a faint lower 75 kDa band) in plasma 

3 5 membrane extracts of primary macrophage and the RAW264. 7 membranes but not of P 8 1 5 
cells (FIG. 2A). The pre-immune and immune sera were tested in a functional assay for their 
ability to block re-presentation of gp96-chaperoned peptides. The L d -restricted epitope AH1 
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derived from the gp70 antigen of murine colon carcinoma CT26 (Huang et al, 1996, Proc. 
Natl. Acad. Sci. U.S.A. 93:9730-9735) was used as the model system. Complexes of gp96 
with an AH1 precursor (used to inhibit direct presentation) were pulsed onto RAW264.7 
cells which were used to stimulate a L d /AH1 -specific CD8+ T cell clone. Release of 

5 interferon- y by the clones was measured as a marker of their activation. RAW264.7 cells 
were able to re-present gp96-chaperoned AH1 precursor effectively in this assay. It was 
observed that at the highest concentration, the immune sera inhibited re-presentation 
completely (FIG. 2B). Although the pre-immune serum was ineffective in inhibiting re- 
presentation as compared to the immune sera, it did inhibit re-presentation significantly at 

10 higher concentrations. The significance of this observation became clear later when we 
determined the identity of the gp96 receptor. Repeated immunizations with the affinity- 
purified gp96-binding proteins did not result in corresponding increase in antibody titers. 

Identification of the 80 kDa protein as an amino terminal fragment of the heavy chain 
of the a2M receptor. The 80 kDa protein eluted from the gp96 affinity column was resolved 

1$ on SDS-PAGE and visualized by staining with Coomassie Brilliant Blue. The protein band 
was subjected to in-gel trypsin digestion and mass spectrometry-based protein 
microsequencing as described in the methods in Section 6.2. Four independent tryptic 
peptides corresponding to N-terminal region of the a 2-macroglobulin (cc2M) receptor, 
designated by immunologists as CD91, were identified (FIG. 3C). 

20 &2M inhibits re-presentation of a gp96-chaperoned antigenic peptide by RA W264. 7. 

oc2M receptor is one of the known natural ligands for the a2M receptor. Its ability to inhibit 
re-presentation of gp96-chaperoned antigenic peptide AH1 was tested in the assay described 
in FIG. 2. a2M but not control proteins selectin (CD62) or serum albumin was observed to 
inhibit re-presentation completely and titratably (FIG. 4). This observation was also 

25 consistent with the result in FIG. 2 that while the pre-immune serum did not detect an 80 
kDa band in plasma membranes of RAW264.7 cells, it did inhibit re-presentation to some 
degree at high concentrations. Thus, by structural as well as functional criteria, the a2M 
receptor was determined to fulfill the criteria essential for a receptor for gp96. 

30 6.4 DISCUSSION 

The a2M receptor, which is also designated CD91, was initially identified as a 

protein related to the low density lipoprotein (LDL) receptor Related Protein (LRP) 

(Strickland et al, 1990, J. Biol. Chem.265:17401-17404; Kristensen et al, 1990, FEBS Lett. 

276:151-155). The protein consists of an -420 kDa a subunit, an 85 kDa p subunit and a 39 
35 kDa tightly associated molecule (RAP). The a and p subunits are encoded by a single 

transcript of -15 Kb in size (Van Leuven et al, 1993, Biochim. Biophys. Acta. 1173:71-74. 

The receptor has been shown to be present in cells of the monocytic lineage and in 
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hepatocytes, fibroblasts and keratinocytes. CD91 has been shown previously to bind the 
activated form of the plasma glycoprotein oc2M, which binds to and inhibits a wide variety of 
endoproteinases. a2M receptor also binds to other ligands such as transforming growth 
factor p (O'Connor-McCourt et al., 1987, J. Biol. Chem. 262:14090-14099), platelet-derived 

5 growth factor (Huang et al. , 1 984, Proc. Natl. Acad. Sci. U.S.A. 8 1 :342-346), and fibroblast 
growth factor (Dennis et al., 1989, J. Biol. Chem. 264:7210-7216). a2M is thus believed to 
regulate, and specifically diminish, the activities of its various ligands. Complexed with 
these various ligands, a2M binds a2M receptor on the cell surface and is internalized 
through receptor-mediated endocytosis. Uptake of cc2M-complexed ligands has been 

IQ assumed thus far to be the primary function of the a2M receptor, although a role for it in 
lipid metabolism is also assumed. a2M receptor ligands other than aZM, such as tissue- 
specific plasminogen activator-inhibitor complex (Orth et al, 1992, Proc. Natl. Acad. Sci. 
U.S.A. 89:7422-7426) and urokinase-PAIl complex (Nykjaer et al, 1992, J. Biol. Chem. 
267: 14543-14546), have been identified. These ligands attest to a role for a2M receptor in 

1 5 clearing a range of extracellular, plasma products. 

The studies reported here show that the heat shock protein gp96 is an additional 
ligand for the cc2M receptor. The human gp96-coding gene has been mapped previously by 
us at chromosome 12 (q24.2^q24.3) (Maki et al., 1993, Somatic Cell Mol. Gen. 19:73-81). 
It is of interest in this regard that the a2M receptor gene has been mapped to the same 

20 chromosome and at a not too distant location (ql3->ql4) (Hilliker et al. Genomics 13:472- 
474). Gp96 binds a2M receptor directly and not through other ligands such as a2M. 
Homogenous preparations of gp96, in solution, or cross-linked to a solid matrix, bind to the 
a2M receptor. Indeed, the major ligand for the a2M receptor, a2M, actually inhibits 
interaction of gp96 with cc2M receptor, instead of promoting it, providing evidence that gp96 

25 is a direct ligand for the a2M receptor. The 80 kDa protein shown to bind gp96 is clearly an 
amino terminal degradation product of the a subunit of the a2M receptor. Degradation 
products of the a2M receptor in this size range have also been observed in previous studies 
(Jensen et al, 1989, Biochem. Arch. 5:171-176), and may indicate the existence of a discrete 
ectodomain in the a2M receptor which may be particularly sensitive to proteolytic cleavage. 

30 As shown here, the gp96- a2M receptor interaction provides a new type of function 

for cc2M receptor, a function of a sensor, not only of the extracellular environment with its 
previously known plasma-based ligands, but also a sensor of the intracellular milieu as well. 
HSPs such as gp96 are obligate intracellular molecules and are released into the extracellular 
milieu only under conditions of necrotic (but not apoptotic) cell death. Thus, the <x2M 

35 receptor may act as a sensor for necrotic cell death (see FIG. 5), just as the scavenger 

receptor CD36 and the recently identified phosphatidyl serine-binding protein act as sensors 
of apoptotic cell death and receptors for apoptotic cells (Savill et al, 1992, J. Clin. 
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Invest.90: 1 513-1 522; Fadok et al. , 2000, Nature 405 :85-90). Interaction of the macrophages 
with the apoptotic cells leads to a down-regulation of the inflammatory cytokines such as 
TNF (Fadok et al. , 2000, supra), while gp96-APC interaction leads to re-presentation of 
gp96-chaperoned peptides by MHC I molecules of the APC, followed by stimulation of 

5 antigen-specific T cells (Suto and Srivastava, 1995, supra) and, in addition, secretion of pro- 
inflammatory cytokines such as TNF, GM-CSF and IL-12. Interestingly, ct2M, an 
independent ligand for the a2M receptor, inhibits representation of gp96-chaperoned 
peptides by macrophages. This observation suggests that re-presentation of gp96-chaperoned 
peptides can not occur physiologically in blood, but only within tissues as a result of 

10 localized necrotic cell death. This is consistent with the complete absence of gp96 or other 
HSPs in blood under all conditions tested. Together, these observations point towards a 
possible mechanism whereby the release of HSPs in the blood as a result of severe tissue 
injury and lysis will not lead to a systemic and lethal pro-inflammatory cytokine cascade. 
It is possible, therefore, that the cc2M receptor renders it possible for the APCs to 

1 5 sample (i) the extracellular milieu of the blood through a2M and other plasma ligands and 
(ii) the intracellular milieu of the tissues through HSPs, particularly of the gp96 family. The 
former permits APCs to implement their primordial phagocytic function, while the latter 
allows them to execute its innate and adaptive immunological functions. Viewed in another 
perspective, recognition of apoptotic cells by APCs through CD36 or phophatidyl serine, 

20 leads to anti-inflammatory signals, while interaction of the APC with necrotic cells through 
a2M receptor leads to pro-inflammatory innate and adaptive immune responses (see 
Srivastava et al., 1998, Immunity 8: 657-665). 

The invention is not to be limited in scope by the specific embodiments described 
25 which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. 
Indeed various modifications of the invention, in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
30 appended claims. 

All references cited herein, including patent applications, patents, and other 
publications, are incorporated by reference herein in their entireties for all purposes. 



35 
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WHAT IS CLAIMED IS : 

1 . A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

5 (a) contacting a test compound with a heat shock protein and an alpha (2) 

macroglobulin receptor; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity or 
expression, 

such that if the level of activity or expression measured in (b) differs from the level of alpha 
I o (2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. 

2. The method of Claim 1, in which the compound identified is an antagonist 
which interferes with the interaction of the heat shock protein with the alpha (2) 

15 macroglobulin receptor, further comprising the step of: 

(c) determining whether the level interferes with the interaction of the heat shock 
protein and the alpha (2) macroglobulin receptor. 

3. The method of Claim 1, in which the test compound is an antibody specific 
20 for the alpha (2) macroglobulin receptor. 

4. The method of Claim 1, in which the test compound is an antibody is specific 
for alpha (2) macroglobulin. 

25 5. The method of Claim 1, in which the test compound is an antibody is specific 

for a heat shock protein. 

6. The method of Claim 1 , in which the test compound is a small molecule. 

30 7. The method of Claim 1 , in which the test compound is a peptide. 

8. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of the alpha (2) macroglobulin receptor (SEQ ID NO.: 7). 

35 9. The method of Claim 7, in which the peptide comprises at least 5 consecutive 

amino acids of alpha (2) macroglobulin (SEQ ID NO.: 4). 



-76- 



NY2- 1119469.2 



10. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of a heat shock protein sequence. 

1 1 . The method of Claim 1 , in which the compound is an agonist which enhances 
5 the interaction of the heat shock protein with the alpha (2) macroglobulin receptor. 

12. The method of Claim 1 in which the HSP-cc2M receptor-mediated process 
affects an autoimmune disorder, a disease or disorder involving disruption of antigen 
presentation or endocytosis, a disease or disorder involving cytokine clearance or 

10 inflammation, a proliferative disorder, a viral disorder or other infectious disease, 
hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 

13. A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

j 5 (a) contacting a test compound with a heat shock protein and an alpha (2) 

macroglobulin receptor-expressing cell; and 
(b) measuring the level of alpha (2) macroglobulin receptor activity or expression 
in the cell, 

such that if the level of activity or expression measured in (b) differs from the level of alpha 
20 (2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. 

14. The method of Claim 1 or 13 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to interact with a heat shock protein. 

25 

15. The method of Claim 13 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to re-present the antigenic peptide. 

30 16. The method of Claim 1 3 wherein the heat shock protein is non-covalently 

associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

17. A method for identifying a compound that modulates the binding of a heat 
35 shock protein to the a2M receptor, comprising: 
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(a) contacting a heat shock protein with an alpha (2) macroglobulin receptor, or 
fragment, or analog, derivative or mimetic thereof, in the presence of a test 
compound; and 

(b) measuring the amount of heat shock protein bound to the alpha (2) 
macroglobulin receptor, or fragment, analog, derivative or mimetic thereof, 

such that if the amount of bound heat shock protein measured in (b) differs from the amount 
of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the a2M receptor is identified. 

18. The method of Claim 17 or 67, in which the alpha (2) macroglobulin receptor 
contacted in step (a) is on a cell surface. 



19. The method of Claim 17 or 67, wherein the alpha (2) macroglobulin receptor 
is immobilized to a solid surface. 

20. The method of Claim 1 9 wherein the solid surface is a microtiter dish. 



2 1 . The method of Claim 1 7 wherein the amount of bound heat shock protein is 
measured by contacting the cell with a heat shock protein-specific antibody. 

20 

22. The method of Claim 1 7 wherein the heat shock protein is labeled and the 
amount of bound heat shock protein is measured by detecting the label. 

23 . The method of Claim 22 wherein the heat shock protein is labeled with a 
25 fluorescent label. 



24. A method for identifying a compound that modulates heat shock protein- 
mediated antigen presentation by alpha (2) macroglobulin receptor-expressing cells 
comprising: 

(a) adding a test compound to a mixture of alpha (2) macroglobulin receptor- 
expressing cells and a complex consisting essentially of a heat shock protein 
noncovalently associated with an antigenic molecule, under conditions 
conducive to alpha (2) macroglobulin receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
alpha (2) macroglobulin receptor-expressing cells, 
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such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the test compound, then a compound that modulates heat shock protein-mediated antigen 
presentation by alpha (2) macroglobulin receptor-expressing cells is identified. 

5 25. The method of Claim 24 or 70, in which the measuring stimulation of antigen- 

specific cytotoxic T cells by the a2M receptor-expressing cells of step (b) comprises: 

(i) adding the alpha (2) macroglobulin receptor-expressing cells formed 
in step (a) to T cells under conditions conducive to the activation of 
the T cells; and 

IQ (ii) comparing the level of activation of said cytotoxic T cells with the 

level of activation of T cells by an alpha (2) macroglobulin receptor- 
expressing cell formed in the absence of the test compound, 
wherein an increase of decrease in level of T cell activation indicates that a compound that 
modulates heat shock protein-mediated antigen presentation by alpha (2) macroglobulin 
j 5 receptor-expressing cells is identified. 

26. The method of Claim 1 , 1 8, or 24 in which the heat shock protein is gp96. 



27. A method for detecting a heat shock protein-alpha (2) macroglobulin receptor- 
20 related disorder in a mammal comprising measuring the level of activity from an HSP-alpha 
(2) macroglobulin receptor-mediated process in a patient sample, such that if the measured 
level differs from the level found in clinically normal individuals, then a heat shock protein- 
alpha (2) macroglobulin receptor-related disorder is detected. 



25 28. The method of Claim 27 comprising contacting a sample derived from a 

patient with an antibody specific for the alpha (2) macroglobulin receptor under conditions 
such that immunospecific binding by the antibody. 

29. The method of Claim 27 comprising contacting a sample derived from a 
30 patient with an antibody specific for a heat shock protein under conditions such that 

immunospecific binding by the antibody. 

30. The method of Claim 27 comprising contacting a sample derived from a 
patient with an antibody specific for an HSP-ct2M complex under conditions such that 

35 immunospecific binding by the antibody. 
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31. A method for modulating an immune response comprising administering to a 
mammal a purified compound that modulates the interaction of a heat shock protein with the 
alpha (2) macroglobulin receptor. 



5 32. The method of Claim 31, in which the compound is an agonist which 

enhances the interaction of the heat shock protein and the alpha (2) macroglobulin receptor. 

33. A method for treating an autoimmune disorder comprising administering to a 
mammal in need of such treatment a purified compound that interferes with the interaction of 

1 q a heat shock protein with the alpha (2) macroglobulin receptor. 

34. The method of Claim 33 in which the compound is an antagonist that 
interferes with the interaction between the heat shock protein and the oc2M receptor. 

35. The method of Claim 34, in which the antagonist is an antibody specific for 
alpha (2) macroglobulin receptor. 

36. The method of Claim 34, in which the antagonist is an antibody specific for a 
heat shock protein. 

20 

37. The method of Claim 34, in which the antagonist is a small molecule. 

38. The method of Claim 34, in which the antagonist is a peptide. 

25 39. The method of Claim 34, in which the peptide comprises at least 5 

consecutive amino acids of alpha (2) macroglobulin receptor (SEQ ID NO.:l). 

40. The method of Claim 34, in which the peptide comprises at least 5 
consecutive amino acids of alpha (2) macroglobulin (SEQ ID NO.: 3). 

30 

41 . The method of Claim 34, in which the peptide comprises at least 5 
consecutive amino acids of a heat shock protein sequence. 

42. A method for treating an autoimmune disorder comprising administering to a 
35 mammal in need of such treatment a recombinant cell that expresses an alpha (2) 

macroglobulin receptor which decreases the uptake of a heat shock protein by a functional 
alpha (2) macroglobulin receptor. 
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43. A method for increasing the immunopotency of a cancer cell or an infected 
cell comprising transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) macroglobulin 
receptor polypeptide. 

44. A method for increasing the immunopotency of a cancer cell or an infected 
cell comprising: 

(a) transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide, and 

(b) administering said cell to an individual in need of treatment, 
so as to obtain an elevated immune response. 



45. A recombinant cancer cell transformed with a nucleic acid comprising a 

j5 nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide. 

46. A recombinant infected cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 

20 macroglobulin receptor polypeptide. 

47. The recombinant cell of Claim 45 or 46 which is a human cell. 

48. A kit, comprising in one or more containers: (a) an anti-a2M receptor 

25 antibody or a nucleic acid probe capable of hybridizing to an a2M receptor nucleic acid, (b) a 
purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (c) instructions for use in detecting a heat shock protein-alpha (2) 
macroglobulin receptor-related disorder. 

30 49. The kit of Claim 48 wherein the antibody or nucleic acid probe is labeled with 

a detectable marker. 



50. The kit of Claim 48 further comprising a labeled macroglobulin receptor 
polypeptide. 

35 

51. A kit, in one or more containers, comprising: (a) a purified heat shock 
protein, nucleic acid encoding a heat shock protein, or cell expressing a heat shock protein; 
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and (b) an alpha (2) macroglobulin receptor polypeptide, nucleic acid encoding an alpha (2) 
macroglobulin receptor polypeptide, or cell expressing an alpha (2) macroglobulin receptor 
polypeptide. 

5 52. The kit of Claim 5 1 in which the alpha (2) macroglobulin receptor 

polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, or cell 
expressing an alpha (2) macroglobulin receptor polypeptide is purified. 

53. The kit of Claim 5 1 further comprising instructions for use in treating an 
10 autoimmune disorder, an infectious disease, or a proliferative disorder. 

54. A method for identifying an oc2M receptor fragment capable of binding a heat 
shock protein, said method comprising: 

(a) contacting a heat shock protein, or peptide-binding fragment thereof, with one 
15 or more alpha (2) macroglobulin receptor fragments; and 

(b) identifying an a2M receptor fragment which specifically binds to the heat 
shock protein, or peptide-binding fragment thereof. 

55. A method for identifying an cc2M receptor fragment capable of inducing an 
20 HSP-a2M receptor-mediated process, said method comprising: 

(a) contacting a heat shock protein with a cell expressing cc2M receptor fragment; 
and 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 
such that if the level of the HSP-cc2M receptor-mediated process or activity measured in (b) 

25 is greater than the level of alpha (2) macroglobulin receptor activity in the absence of the 
a2M receptor fragment, then an a2M receptor fragment capable of inducing an HSP-a2M 
receptor-mediated process is identified. 

56. The method of Claim 55 wherein the alpha (2) macroglobulin receptor 
30 activity measured is the ability to interact with the heat shock protein. 

57. The method of Claim 55 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to re-present the antigenic peptide. 

35 
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58. The method of Claim 55 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) microglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

5 59. A method for identifying a heat shock protein fragment capable of binding an 

a2M receptor, said method comprising: 

(a) contacting an a2M receptor with one or more heat shock protein fragments; 
and 

(b) identifying a heat shock protein fragment which specifically binds to the a2M 
10 receptor. 

60. A method for identifying a heat shock protein fragment capable of inducing 
an HSP-a2M receptor-mediated process, said method comprising: 

(a) contacting an a2M receptor fragment with a cell expressing a heat shock 
15 protein; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 
such that if the level of the HSP-ct2M receptor-mediated process or activity measured in (b) 
is greater than the level of alpha (2) macroglobulin receptor activity in the absence of said 
heat shock protein fragment, then a heat shock protein fragment capable of inducing an HSP- 

20 a2M receptor-mediated process is identified. 

6 1 . The method of Claim 60 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to interact with the heat shock protein fragment. 

25 62. The method of Claim 60 wherein the heat shock protein fragment is non- 

covalently associated with an antigenic peptide and the alpha (2) macroglobulin receptor 
activity measured is the ability to re-present the antigenic peptide. 

63. The method of Claim 60 wherein the heat shock protein fragment is non- 
30 covalently associated with an antigenic peptide and the alpha (2) macroglobulin receptor 

activity measured is the ability to stimulate a cytotoxic T cell response against the antigenic 
peptide. 

64. A method for identifying a molecule that binds specifically to an a2M 
35 receptor, said method comprising: 

(a) contacting an a2M receptor with one or more test molecules under conditions 
conducive to binding; and 
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(b) identifying one or more test molecules that specifically bind to the a2M 
receptor. 



65. The method of Claim 64 wherein said test molecules are potential 
5 immunotherapeutic drugs. 

66. A method for screening for molecules that specifically bind to an oc2M 
receptor comprising: 

(a) contacting an a2M receptor with one or more test molecules under conditions 
10 conducive to binding; and 

(b) determining whether any of said test molecules specifically bind to the a2M 
receptor. 

67. A method for identifying a compound that modulates the binding of an a2M 
15 receptor ligand to the a2M receptor comprising: 

(a) contacting an cc2M receptor with an a2M receptor ligand, or an a2M receptor- 
binding fragment, analog, derivative or mimetic thereof, in the presence of 
one or more test compounds; and 

(b) measuring the amount of a2M receptor ligand, or fragment, analog, derivative 
20 or mimetic thereof, bound to the a2M receptor, 

such that if the amount of bound a2M receptor ligand measured in (b) differs from the 
amount of bound a2M receptor measured in the absence of the test compound, then a 
compound that modulates the binding of an a2M receptor ligand to the oc2M receptor is 
identified. 

25 

68. A method for identifying a compound that modulates the interaction between 
the oc2M receptor and an a2M receptor ligand, comprising: 

(a) contacting an oc2M receptor with one or more test compounds; and 

(b) measuring the level of a2M receptor activity or expression, 

30 such that if the level of activity or expression measured in (b) differs from the level of a2M 
receptor activity in the absence of one or more test compounds, then a compound that 
modulates the interaction between the cc2M receptor and an cc2M receptor ligand is 
identified. 

35 69. The method of Claim 67 or 68 wherein the a2M receptor ligand is a2 

macroglobulin. 
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70. A method for identifying a compound that modulates antigen presentation by 
a2M receptor-expressing cells comprising: 

(a) adding one or more test compounds to a mixture of a2M receptor-expressing 
cells and a complex comprising an a2M receptor ligand and an antigenic 

5 molecule, under conditions conducive to a2M receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
a2M receptor-expressing cells, 

such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the one or more test compounds, then a compound that modulates antigen presentation by 
IQ oc2M receptor-expressing cells is identified. 

71. A method for modulating an immune response comprising administering to a 
mammal a purified compound that binds to the a2M receptor, in an amount effective to 
modulate an immune response in the mammal. 

15 

72. A method for treating or preventing a disease or disorder comprising 
administering to a mammal a purified compound that binds to the a2M receptor, in an 
amount effective to treat or prevent the disease or disorder in the mammal. 

2Q 73 . The method of Claim 72 wherein the disease or disorder is cancer or an 

infectious disease. 

74. A method for treating an autoimmune disorder comprising administering to a 
mammal in need of such treatment a purified compound that binds to the cc2M receptor, in an 
25 amount effective to treat an autoimmune disorder in the mammal. 



30 
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ABSTRACT 



The present invention relates to the use of alpha (2) macroglobulin ("a2M") receptor 
as a heat shock protein receptor, cells that express the oc2M receptor bound to an HSP, and 
5 antibodies and other molecules that bind the a2M receptor-HSP complex. The invention 
also relates to screening assays to identify compounds that interact with the cc2M receptor, 
and modulate the interaction of the a2M receptor with its ligand, such as HSPs, and methods 
for using compositions comprising a2M-receptor sequences for the diagnosis and treatment 
of immune disorders, proliferative disorders, and infectious diseases. 
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CGCTGCTCCC CGCCAGTGCA CTGAGGAGGC GGAAACGGGG GAGCCCCTAG TGCTCCATCA 60 

GGCCCCTACC AAGGCACCCC CATCGGGTCC ACGCCCCCCA CCCCCCACCC CGCCTCCTCC 120 

CAATTGTGCA TTTTTGCAGC CGGAGTCGGC TCCGAGATGG GGCTGTGAGC TTCGCCCTGG 180 

GAGGGGGAGA GGAGCGAGGA GTAAAGCAGG GGTGAAGGGT TCGAATTTGG GGGCAGGGGG 24 0 

CGCACCCGCG TCAGCAGGCC CTTCCCAGGG GGCTCGGAAC TGTACCATTT CACCTATGCC 300 

CCTGGTTCGC TTTGCTTAAG GAAGGATAAG ATAGAAGAGT CGGGGAGAGG AAGATAAAGG 360 

GGGACCCCCC AATTGGGGGG GGCGAGGACA AGAAGTAACA GGACCAGAGG GTGGGGGCTG 420 

CTGTTTGCAT CGGCCCACAC C ATG CTG ACC CCG CCG TTG CTG CTG CTC GTG 4 71 

Met Leu Thr Pro Pro Leu Leu Leu Leu Val 

15 10 

CCG CTG CTT TCA GCT CTG GTC TCC GGG GCC ACT ATG GAT GCC CCT AAA 519 

Pro Leu Leu Ser Ala Leu Val Ser Gly Ala Thr Met Asp Ala Pro Lys 

15 20 25 

ACT TGC AGC CCT AAG CAG TTT GCC TGC AGA GAC CAA ATC ACC TGT ATC 567 

Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg Asp Gin lie Thr Cys lie 

30 35 ' 40 

TCA AAG GGC TGG CGG TGT GAC GGT GAA AGA GAT TGC CCC GAC GGC TCT 615 

Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg Asp Cys Pro Asp Gly Ser 

45 50 55 



GAT GAA GCC CCT GAG ATC TGT CCA CAG AGT AAA GCC CAG AGA TGC CCG 663 

Asp Glu Ala Pro Glu lie Cys Pro Gin Ser Lys Ala Gin Arg Cys Pro 

60 65 70 

CCA AAT GAG CAC AGT TGT CTG GGG ACT GAG CTA TGT GTC CCC ATG TCT 711 

Pro Asn Glu His Ser Cys Leu Gly Thr Glu Leu Cys Val Pro Met Ser 

75 80 85 90 

CGT CTC TGC AAC GGG ATC CAG GAC TGC ATG GAT GGC TCA GAC GAG GGT 75 9 

Arg Leu Cys Asn Gly lie Gin Asp Cys Met Asp Gly Ser Asp Glu Gly 

95 100 105 

GCT CAC TGC CGA GAG CTC CGA GCC AAC TGT TCT CGA ATG GGT TGT CAA 8 07 

Ala His Cys Arg Glu Leu Arg Ala Asn Cys Ser Arg Met Gly Cys Gin 

110 115 120 

CAC CAT TGT GTA CCT ACA CCC AGT GGG CCC ACG TGC TAC TGT AAC AGC 8 55 

His His Cys Val Pro Thr Pro Ser Gly Pro Thr Cys Tyr Cys Asn Ser 

125 130 135 



AGC TTC CAG CTC GAG GCA GAT GGC AAG ACG TGC AAA GAT TTT GAC GAG 903 

Ser Phe Gin Leu Glu Ala Asp Gly Lys Thr Cys Lys Asp Phe Asp Glu 
140 145 150 

TGT TCC GTG TAT GGC ACC TGC AGC CAG CTT TGC ACC AAC ACA GAT GGC 951 

Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu Cys Thr Asn Thr Asp Gly 
155 160 165 170 



TCC TTC ACA TGT GGC TGT GTT GAA GGC TAC CTG CTG CAA CCG GAC AAC 9 99 

Ser Phe Thr Cys Gly Cys Val Glu Gly Tyr Leu Leu Gin Pro Asp Asn 
175 180 - 185 

CGC TCC TGC AAG GCC AAG AAT GAG CCA GTA GAT CGG CCG CCA GTG CTA 1047 
Arg Ser Cys Lys Ala Lys Asn Glu Pro Val Asp Arg Pro Pro Val Leu 
190 195 200 
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CTG ATT GCC AAC TCT CAG AAC ATC CTA GCT ACG TAC CTG AGT GGG GCC 
Leu lie Ala Asn Ser Gin Asn lie Leu Ala Thr Tyr Leu Ser Gly Ala 
205 210 215 

CAA GTG TCT ACC ATC ACA CCC ACC AGC ACC CGA CAA ACC ACG GCC ATG 
Gin Val Ser Thr He Thr Pro Thr Ser Thr Arg Gin Thr Thr Ala Met 
220 225 230 

GAC TTC AGT TAT GCC AAT GAG ACC GTA TGC TGG GTG CAC GTT GGG GAC 
Asp Phe Ser Tyr Ala Asn .Glu Thr Val Cys Trp Val His Val Gly Asp 
235 240 245 250 

AGT GCT GCC CAG ACA CAG CTC AAG TGT GCC CGG ATG CCT GGC CTG AAG 
Ser Ala Ala Gin Thr Gin Leu Lys Cys Ala Arg Met Pro Gly Leu Lys 
255 260 265 

GGC TTT GTG GAT GAG CAT ACC ATC AAC ATC TCC CTC AGC CTG CAC CAC 
Gly Phe Val Asp Glu His Thr He Asn He Ser Leu Ser Leu His His 
270 - 275 280 

GTG GAG CAG ATG GCA ATC GAC TGG CTG ACG GGA AAC TTC TAC TTT GTC 
Val Glu Gin Met Ala He Asp Trp Leu Thr Gly Asn Phe Tyr Phe Val 
285 290 295 

GAC GAC ATT GAC GAC AGG ATC TTT GTC TGT AAC CGA AAC GGG GAC ACC 
Asp Asp He Asp Asp Arg He Phe Val Cys Asn Arg Asn Gly Asp Thr 
300 " 305 310 

TGT GTC ACT CTG CTG GAC CTG GAA CTC TAC A^C CCC AAA GGC ATC GCC 
Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr Asn Pro Lys Gly He Ala 
315 320 325 330 

TTG GAC CCC GCC ATG GGG AAG GTG TTC TTC ACT GAC TAC GGG CAG ATC 
Leu Asp Pro Ala Met Gly Lys Val Phe Phe Thr Asp Tyr Gly Gin He 
335 340 345 

CCA AAG GTG GAG CGC TGT GAC ATG GAT GGA CAG AAC CGC ACC AAG CTG 
Pro Lys Val Glu Arg Cys Asp Met Asp Gly Gin Asn Arg Thr Lys Leu 
350 " ' 355 360 

GTG GAT AGC AAG ATC GTG TTT CCA CAC GGC ATC ACC CTG GAC CTG GTC 
Val Asp Ser Lys He Val Phe Pro His Gly He Thr Leu Asp Leu Val 
365 370 375 

AGC CGC CTC GTC TAC TGG GCG GAC GCC TAC CTA GAC TAC ATC GAG GTG 
Ser Arg Leu Val Tyr Trp Ala Asp Ala Tyr Leu Asp Tyr He Glu Val 
380 " 385 390 

GTA GAC TAC GAA GGG AAG GGT CGG CAG ACC ATC ATC CAA GGC ATC CTG 
Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr He He Gin Gly He Leu 
395 400 405 410 

ATC GAG CAC CTG TAC GGC CTG ACC GTG TTT GAG AAC TAT CTC TAC GCC 
He Glu His Leu Tyr Gly Leu Thr Val Phe Glu Asn Tyr Leu Tyr Ala 
415 420 425 

ACC AAC TCG GAC AAT GCC AAC ACG CAG CAG AAG ACG AGC GTG ATC CGA 
Thr Asn Ser Asp Asn Ala Asn Thr Gin Gin Lys Thr Ser Val He Arg 
430 435 * 440 
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GTG AAC CGG TTC AAC AGT ACT GAG TAC CAG GTC GTC ACC CGT GTG GAC 
Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin Val Val Thr Arg Val Asp 
445 450 455 

AAG GGT GGT GCC CTG CAT ATC TAC CAC CAG CGA CGC CAG CCC CGA GTG 
Lys Gly Gly Ala Leu His lie Tyr His Gin Arg Arg Gin Pro Arg Val 
460 465 470 

CGG AGT CAC GCC TGT GAG AAT GAC CAG TAC GGG AAG CCA GGT GGC TGC 
Arg Ser His Ala Cys Glu Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cys 
475 480 485 490 

TCC GAC ATC TGC CTC CTG GCC AAC AGT CAC AAG GCA AGG ACC TGC AGG 
Ser Asp lie Cys Leu Leu Ala Asn Ser His Lys Ala Arg Thr Cys Arg 
4 95 500 505 

TGC AGG TCT GGC TTC AGC CTG GGA AGT GAT GGG AAG TCT TGT AAG AAA 
Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys 
510 515 520 

CCT GAA CAT GAG CTG TTC CTC GTG TAT GGC AAG GGC CGA CCA GGC ATC 
Pro Glu His Glu Leu Phe Leu Val Tyr Gly Lys Gly Arg Pro Gly lie 
525 530 535 

ATT AGA GGC ATG GAC ATG GGG GCC AAG GTC CCA GAT GAG CAC ATG ATC 
lie Arg Gly Met Asp Met Gly Ala Lys Val Pro Asp Glu His Met lie 
540 " 545 " 550 

CCC ATC GAG AAC CTT ATG AAT CCA CGC GCT CTG GAC TTC CAC GCC GAG 
Pro lie Glu Asn Leu Met Asn Pro Arg Ala Leu Asp Phe His Ala Glu 
555 560 565 570 

ACC GGC TTC ATC TAC TTT GCT GAC ACC ACC AGC TAC CTC ATT GGC CGC 
Thr Gly Phe lie Tyr Phe Ala Asp Thr Thr Ser Tyr Leu He Gly Arg 
575 580 585 

CAG AAA ATT GAT GGC ACG GAG AGA GAG ACT ATC CTG AAG GAT GGC ATC 
Gin Lys He Asp Gly Thr Glu Arg Glu Thr He Leu Lys Asp Gly He 
590 595 600 

CAC AAT GTG GAG GGC GTA GCC GTG GAC TGG ATG GGA GAC AAT CTT TAC 
His Asn Val Glu Gly Val Ala Val Asp Trp Met Gly Asp Asn Leu Tyr 
605 610 615 

TGG ACT GAT GAT GGC CCC AAG AAG ACC ATT AGT GTG GCC AGG CTG GAG 
Trp Thr Asp Asp Gly Pro Lys Lys Thr He Ser Val Ala Arg Leu Glu 
620 625 630 

AAA GCC GCT CAG ACC CGG AAG ACT CTA ATT GAG GGC AAG ATG ACA CAC 
Lys Ala Ala Gin Thr Arg Lys Thr Leu He Glu Gly Lys Met Thr His 
635 640 645 650 

CCC AGG GCC ATT GTA GTG GAT CCA CTC AAT GGG TGG ATG TAC TGG ACA 
Pro Arg Ala He Val Val Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr 
655 " 660 665 

GAC TGG GAG GAG GAC CCC AAG GAC AGT CGG CGA GGG CGG CTC GAG AGG 
Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg Arg Gly Arg Leu Glu Arg 
670 675 680 
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GCT TGG ATG GAC GGC TCA CAC CGA GAT ATC TTT GTC ACC TCC AAG ACA 
Ala Trp Met Asp Gly Ser His Arg Asp He Phe Val Thr Ser Lys Thr 
685 690 695 

GTG CTT TGG CCC AAT GGG CTA AGC CTG GAT ATC CCA GCC GGA CGC CTC 
Val Leu Trp Pro Asn Gly Leu Ser Leu Asp He Pro Ala Gly Arg Leu 
700 705 710 

TAC TGG GTG GAT GCC TTC TAT GAC CGA ATT GAG ACC ATA CTG CTC AAT 
Tyr Trp Val Asp Ala Phe Tyr Asp Arg He Glu Thr He Leu Leu Asn 
715 ' 720 ' 725 730 

GGC ACA GAC CGG AAG ATT GTA TAT GAG GGT CCT GAA CTG AAT CAT GCC 
Gly Thr Asp Arg Lys He Val Tyr Glu Gly Pro Glu Leu Asn His Ala 
735 " 740 745 

TTC GGC CTG TGT CAC CAT GGC AAC TAC CTC TTT TGG ACC GAG TAC CGG 
Phe Gly Leu Cys His His Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg 
750 755 760 

AGC GGC AGC GTC TAC CGC TTG GAA CGG GGC GTG GCA GGC GCA CCG CCC 
Ser Gly Ser Val Tyr Arg Leu Glu Arg Gly Val Ala Gly Ala Pro Pro 
765 770 775 

ACT GTG ACC CTT CTG CGC AGC GAG AGA CCG CCT ATC TTT GAG ATC CGA 
Thr Val Thr Leu Leu Arg Ser Glu Arg Pro Pro He Phe Glu He Arg 
780 785 ' 790 

ATG TAC GAC GCG CAC GAG CAG CAA GTG GGT ACC AAC AAA TGC CGG GTA 
Met Tyr Asp Ala His Glu Gin Gin Val Gly Thr Asn Lys Cys Arg Val 
795 800 805 810 

AAT AAC GGA GGC TGC AGC AGC CTG TGC CTC GCC ACC CCC GGG AGC CGC 
Asn Asn Gly Gly Cys Ser Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg 
815 820 825 

CAG TGT GCC TGT GCC GAG GAC CAG GTG TTG GAC ACA GAT GGT GTC ACC 
Gin Cys Ala Cys Ala Glu Asp Gin Val Leu Asp Thr Asp Gly Val Thr 
830 835 840 

TGC TTG GCG AAC CCA TCC TAC GTG CCC CCA CCC CAG TGC CAG CCG GGC 
Cys Leu Ala Asn Pro Ser Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly 
845 850 855 

CAG TTT GCC TGT GCC AAC AAC CGC TGC ATC CAG GAG CGC TGG AAG TGT 
Gin Phe Ala Cys Ala Asn Asn Arg Cys He Gin Glu Arg Trp Lys Cys 
860 865 870 

GAC GGA GAC AAC GAC TGT CTG GAC AAC AGC GAT GAG GCC CCA GCA CTG 
Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu 
875 880 885 890 

TGC CAT CAA CAC ACC TGT CCC TCG GAC CGA TTC AAG TGT GAG AAC AAC 
Cys His Gin His Thr Cys Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn 
895 900 905 

CGG TGT ATC CCC AAC CGC TGG CTC TGT GAT GGG GAT AAT GAT TGT GGC 
Arg Cys He Pro Asn Arg Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly 
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AAC AGC GAG GAC GAA TCC AAT GCC ACG TGC TCA GCC CGC ACC TGT CCA 
Asn Ser Glu Asp Glu Ser Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro 
925 930 935 

CCC AAC CAG TTC TCC TGT GCC AGT GGC CGA TGC ATT CCT ATC TCA TGG 
Pro Asn Gin Phe Ser Cys Ala Ser Gly Arg Cys He Pro He Ser Trp 
940 945 950 

ACC TGT GAT CTG GAT GAT GAC TGT GGG GAC CGG TCC GAT GAG TCA GCC 
Thr Cvs Asp Leu Asp Asp Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala 
955 960 965 970 

TCA TGC GCC TAC CCC ACC TGC TTC CCC CTG ACT CAA TTT ACC TGC AAC 
Ser Cvs Ala Tyr Pro Thr Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn 
975 980 985 

AAT GGC AGA TGT ATT AAC ATC AAC TGG CGG TGT GAC AAC GAC AAT GAC 
Asn Glv Arg Cys He Asn He Asn Trp Arg Cys Asp Asn Asp Asn Asp 
990 995 1000 

TGT GGG GAC AAC AGC GAC GAA GCC GGC TGC AGT CAC TCC TGC TCC AGT 
Cvs Gly Asp Asn Ser Asp Glu Ala Gly Cys Ser His Ser Cys Ser Ser 
1005 1010 1015 

ACC CAG TTC AAG TGC AAC AGT GGC AGA TGC ATC CCC GAG CAC TGG ACG 
Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys He Pro Glu His Trp Thr 
10 20 1025 1030 

TGT GAT GGG GAC AAT GAT TGT GGG GAC TAC AGC GAC GAG AC A CAC GCC 
Cvs Asp Gly Asp Asn Asp Cys Gly Asp Tyr Ser Asp Glu Thr His Ala 
10 35 1040 1045 1050 

AAC TGT ACC AAC CAG GCT AC A AGA CCT CCT GGT GGC TGC CAC TCG GAT 
Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly Gly Cys His Ser Asp 
1055 1060 1065 

GAG TTC CAG TGC CCG CTA GAT GGC CTG TGC ATC CCC CTG AGG TGG CGC 
Glu Phe Gin Cys Pro Leu Asp Gly Leu Cys He Pro Leu Arg Trp Arg 
1070 1075 1080 

TGC GAC GGG GAC ACC GAC TGC ATG GAT TCC AGC GAT GAG AAG AGC TGT 
Cvs Asp Gly Asp Thr Asp Cys Met Asp Ser Ser Asp Glu Lys Ser Cys 
1085 1090 1095 

GAG GGC GTG ACC CAT GTT TGT GAC CCG AAT GTC AAG TTT GGC TGC AAG 
Glu Gly Val Thr His Val Cys Asp Pro Asn Val Lys Phe Gly Cys Lys 
1100 1105 1110 

GAC TCC GCC CGG TGC ATC AGC AAG GCG TGG GTG TGT GAT GGC GAC AGC 
Asp Ser Ala Arg Cys He Ser Lys Ala Trp Val Cys Asp Gly Asp Ser 
1115 1120 H25 H30 

GAC TGT GAA GAT AAC TCC GAC GAG GAG AAC TGT GAG GCC CTG GCC TGC 
Asp Cvs Glu Asp Asn Ser Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys 
1135 1140 1145 

AGG CCA CCC TCC CAT CCC TGC GCC AAC AAC ACC TCT GTC TGC CTG CCT 
Arcr Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser Val Cys Leu Pro 
1150 1155 1160 
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CCT GAC AAG CTG TGC GAC GGC AAG GAT GAC TGT GGA GAC GGC TCG GAT 
Pro Asp Lys Leu Cys Asp Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp 
1165 1170 1175 

GAG GGC GAG CTC TGT GAC CAG TGT TCT CTG AAT AAT GGT GGC TGT AGT 
Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser 
1180 1185 1190 

CAC AAC TGC TCA GTG GCC CCT GGT GAA GGC ATC GTG TGC TCT TGC CCT 
His Asn Cys Ser Val Ala Pro Gly Glu Gly lie Val Cys Ser Cys Pro 
1195 1200 1205 1210 

CTG GGC ATG GAG CTG GGC TCT GAC AAC CAC ACC TGC CAG ATC CAG AGC 
Leu Gly Met Glu Leu Gly Ser Asp Asn His Thr Cys Gin lie Gin Ser 
1215 1220 1225 

TAC TGT GCC AAG CAC CTC AAA TGC AGC CAG AAG TGT GAC CAG AAC AAG 
Tyr Cys Ala Lys His Leu Lys Cys Ser Gin -Lys Cys Asp Gin Asn Lys 
1230 1235 1240 

TTC AGT GTG AAG TGC TCC TGC TAC GAG GGC TGG GTC TTG GAG CCT GAC 
Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp 
1245 1250 1255 

GGG GAA ACG TGC CGC AGT CTG GAT CCC TTC AAA CTG TTC ATC ATC TTC 
Gly Glu Thr Cys Arg Ser Leu Asp Pro Phe Lys Leu Phe lie lie Phe 
1260 1265 1270 

TCC AAC CGC CAC GAG ATC AGG CGC ATT GAC CTT CAC AAG GGG GAC TAC 
Ser Asn Arg His Glu He Arg Arg lie Asp Leu His Lys Gly Asp Tyr 
1275 1280 1285 1290 

AGC GTC CTA GTG CCT GGC CTG CGC AAC ACT ATT GCC CTG GAC TTC CAC 
Ser Val Leu Val Pro Gly Leu Arg Asn Thr He Ala Leu Asp Phe His 
1295 1300 1305 

CTC AGC CAG AGT GCC CTC TAC TGG ACC GAC GCG GTA GAG GAC AAG ATC 
Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp Ala Val Glu Asp Lys He 
1310 1315 1320 

TAC CGT GGG AAA CTC CTG GAC AAC GGA GCC CTG ACC AGC TTT GAG GTG 
Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala Leu Thr Ser Phe Glu Val 
1325 1330 1335 

GTG ATT CAG TAT GGC TTG GCC ACA CCA GAG GGC CTG GCT GTA GAT TGG 
Val He Gin Tyr Gly Leu Ala Thr Pro Glu Gly Leu Ala Val Asp Trp 
1340 1345 1350 

ATT GCA GGC AAC ATC TAC TGG GTG GAG AGC AAC CTG GAC CAG ATC GAA 
He Ala Gly Asn He Tyr Trp Val Glu Ser Asn Leu Asp Gin He Glu 
1355 1360 1365 1370 

GTG GCC AAG CTG GAC GGA ACC CTC CGA ACC ACT CTG CTG GCG GGT GAC 
Val Ala Lys Leu Asp Gly Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp 
1375 1380 1385 

ATT GAG CAC CCG AGG GCC ATC GCT CTG GAC CCT CGG GAT GGG ATT CTG 
He Glu His Pro Arg Ala He Ala Leu Asp Pro Arg Asp Gly He Leu 
1390 1395 1400 
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TTT TGG ACA GAC TGG GAT GCC AGC CTG CCA CGA ATC GAG GCT GCA TCC 4 695 
Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro Arg lie Glu Ala Ala Ser 
1405 1410 " 1415 

ATG AGT GGA GCT GGC CGC CGA ACC ATC CAC CGG GAG ACA GGC TCT GGG 47 4 3 
Met Ser Gly Ala Gly Arg Arg Thr lie His Arg Glu Thr Gly Ser Gly 
1420 1425 1430 

GGC TGC GCC AAT GGG CTC ACC GTG GAT TAC CTG GAG AAG CGC ATC CTC 4 791 
Gly Cys Ala Asn Gly Leu Thr Val Asp Tyr Leu Glu Lys Arg He Leu 
1435 1440 * 1445 " 1450 

TGG ATT GAT GCT AGG TCA GAT GCC ATC TAT TCA GCC CGG TAT GAC GGC 4839 
Trp He Asp Ala Arg Ser Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly 
1455 1460 1465 

TCC GGC CAC ATG GAG GTG CTT CGG GGA CAC GAG TTC CTG TCA CAC CCA 4887 
Ser Gly His Met Glu Val Leu Arg Gly His Glu Phe Leu Ser His Pro 
1470 1475 1480 

TTT GCC GTG ACA CTG TAC GGT GGG GAG GTG TAC TGG ACC GAC TGG CGA 4 935 
Phe Ala Val Thr Leu Tyr Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg 
1485 1490 ' 1495 

ACA AAT ACA CTG GCT AAG GCC AAC AAG TGG ACT GGC CAC AAC GTC ACC 4 983 
Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp Thr Gly His Asn Val Thr 
1500 1505 1510 

GTG GTA CAG AGG ACC AAC ACC CAG CCC TTC GAC CTG CAG GTG TAT CAC 5031 
Val Val Gin Arg Thr Asn Thr Gin Pro Phe Asp Leu Gin Val Tyr His 
1515 1520 1525 1530 

CCT TCC CGG CAG CCC ATG GCT CCA AAC CCA TGT GAG GCC AAT GGC GGC 507 9 
Pro Ser Arg Gin Pro Met Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly 
1535 1540 1545 

CGG GGC CCC TGT TCC CAT CTG TGC CTC ATC AAC TAC AAC CGG ACC GTC 5127 • 
Arg Gly Pro Cys Ser His Leu Cys Leu He Asn Tyr Asn Arg Thr Val 
1550 1555 1560 

TCC TGG GCC TGT CCC CAC CTC ATG AAG CTG CAC AAG GAC AAC ACC ACC 5175 
Ser Trp Ala Cys Pro His Leu Met Lys Leu His Lys Asp Asn Thr Thr 
1565 1570 1575 

TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC GCA CGT CAG ATG GAG ATC 5223 
Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr Ala Arg Gin Met Glu He 
1580 * 1585 * 1590 

CGG GGC GTG GAC CTG GAT GCC CCG TAC TAC AAT TAT ATC ATC TCC TTC 5271 
Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr Asn Tyr He lie Ser Phe 
1595 1600 1605 1610 

ACG GTG CCT GAT ATC GAC AAT GTC ACG GTG CTG GAC TAT GAT GCC CGA .5319 
Thr Val Pro Asp He Asp Asn Val Thr Val Leu Asp Tyr- Asp Ala Arg 
1615 1620 1625 

GAG CAG CGA GTT TAC TGG TCT GAT GTG CGG ACT CAA GCC ATC AAA AGG 5367 
Glu Gin Arg Val Tyr Trp Ser Asp Val Arg Thr Gin Ala He Lys Arg 
1630 1635 ' 1640 
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GCA TTT ATC AAC GGC ACT GGC GTG GAG ACC GTT GTC TCT GCA GAC TTG 5 415 

Ala Phe lie Asn Gly Thr Gly Val Glu Thr Val Val Ser Ala Asp Leu 
1645 1650 1655 

CCC AAC GCC CAC GGG CTG GCT GTG GAC TGG GTC TCC CGA AAT CTG TTT 5 4 63 
Pro Asn Ala His Gly Leu Ala Val Asp Trp Val Ser Arg Asn Leu Phe 
1660 1665 1670 

TGG ACA AGT TAC GAC ACC AAC AAG AAG CAG ATT AAC GTG GCC CGG CTG 5511 
Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin lie Asn Val Ala Arg Leu 
1675 1680 1685 1690 

GAC GGC TCC TTC AAG AAT GCG GTG GTG CAG GGC CTG GAG CAG CCC CAC 555 9 
Asp Gly Ser Phe Lys Asn Ala Val Val Gin Gly Leu Glu Gin Pro His 
1695 1700 1705 

GGC CTG GTC GTC CAC CCG CTT CGT GGC AAG CTC TAC TGG ACT GAT GGG 5607 
Gly Leu Val Val His Pro Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly 
1710 1715 ~ 1720 

GAC AAC ATC AGC ATG GCC AAC ATG GAT GGG AGC AAC CAC ACT CTG CTC 5 655 
Asp Asn lie Ser Met Ala Asn Met Asp Gly Ser Asn His Thr Leu Leu 
1725 1730 * ' 1735 

TTC AGT GGC CAG AAG GGC CCT GTG GGG TTG GCC ATT GAC TTC CCT GAG 57 03 
Phe Ser Gly Gin Lys Gly Pro Val Gly Leu Ala lie Asp Phe Pro Glu 
1740 ' ~ 1745 1750 

AGC AAA CTC TAC TGG ATC AGC TCT GGG AAC CAC ACA ATC AAC CGT TGC 57 51 

Ser Lys Leu Tyr Trp lie Ser Ser Gly Asn His Thr lie Asn Arg Cys 
1755 ' 1760 1765 1710 

AAT CTG GAT GGG AGC GAG CTG GAG GTC ATC GAC ACC ATG CGG AGC CAG 57 99 
Asn Leu Asp Gly Ser Glu Leu Glu Val lie Asp Thr Met Arg Ser Gin 
1775 1780 1785 

CTG GGC AAG GCC ACT GCC CTG GCC ATC ATG GGG GAC AAG CTG TGG TGG 58 4 7 
Leu Gly Lys Ala Thr Ala Leu Ala lie Met Gly Asp Lys Leu Trp Trp 
1790 1795 1800 

GCA GAT CAG GTG TCA GAG AAG ATG GGC ACG TGC AAC AAA GCC GAT GGC 5895 
Ala Asp Gin Val Ser Glu Lys Met Gly Thr Cys Asn Lys Ala Asp Gly 
1805 1810 1815 

TCT GGG TCC GTG GTG CTG CGG AAC AGT ACC ACG TTG GTT ATG CAC ATG 594 3 
Ser Gly Ser Val Val Leu Arg Asn Ser Thr Thr Leu Val Met His Met 
1820 1825 1830 

AAG GTG TAT GAC GAG AGC ATC CAG CTA GAG CAT GAG GGC ACC AAC CCC 5991 
Lys Val Tyr Asp Glu Ser lie Gin Leu Glu His Glu Gly Thr Asn Pro 
1835 1840 1845 1850 

TGC AGT GTC AAC AAC GGA GAC TGT TCC CAG CTC TGC CTG CCA ACA TCA 6039 
Cys Ser Val Asn Asn Gly Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser 
1855 1860 1865 

GAG ACG ACT CGC TCC TGT ATG TGT ACA GCC GGT TAC AGC CTC CGG AGC 6087 
Glu Thr Thr Arg Ser Cys Met Cys Thr Ala Gly Tyr Ser Leu Arg. Ser 
1870 1875 1880 
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GGA CAG CAG GCC TGT GAG GGT GTG GGC TCT TTT CTC CTG TAC TCT GTA 
Gly Gin Gin Ala Cys Glu Gly Val Gly Ser Phe Leu Leu Tyr Ser Val 
1885 1890 1895 

CAT GAG GGA ATT CGG GGG ATT CCA CTA GAT CCC AAT GAC AAG TCG GAT 
His Glu Gly lie Arg Gly He Pro Leu Asp Pro Asn Asp Lys Ser Asp 
1900 " * 1905 1910 

GCC CTG GTC CCA GTG TCC GGA ACT TCA CTG GCT GTC GGA ATC GAC TTC 
Ala Leu Val Pro Val Ser Gly Thr Ser Leu Ala Val Gly He Asp Phe 
1915 1920 1925 1930 

CAT GCC GAA AAT GAC ACT ATT TAT TGG GTG GAT ATG GGC CTA AGC ACC 
His Ala Glu Asn Asp Thr He Tyr Trp Val Asp Met Gly Leu Ser Thr 
1935 1940 1945 

ATC AGC AGG GCC AAG CGT GAC CAG ACA TGG CGA GAG GAT GTG GTG ACC 
He Ser Arg Ala Lys Arg Asp Gin Thr Trp Arg Glu Asp Val Val Thr 
1950 1955 1960 

AAC GGT ATT GGC CGT GTG GAG GGC ATC GCC GTG GAC TGG ATC GCA GGC 
Asn Gly He Gly Arg Val Glu Gly He Ala Val Asp Trp He Ala Gly 
1965 1970 1975 

AAC ATA TAC TGG ACG GAC CAG GGC TTC GAT GTC ATC GAG GTT GCC CGG 
Asn He Tyr Trp Thr Asp Gin Gly Phe Asp Val He Glu Val Ala Arg 
1980 1985 1990 

CTC AAT GGC TCT TTT CGT TAT GTG GTC ATT TCC CAG GGT CTG GAC AAG 
Leu Asn Gly Ser Phe Arg Tyr Val Val He Ser Gin Gly Leu Asp Lys 
1995 2000 2005 2010 

CCT CGG GCC ATC ACT GTC CAC CCA GAG AAG GGG TAC TTG TTC TGG ACC 
Pro Arg Ala He Thr Val His Pro Glu Lys Gly Tyr Leu Phe Trp Thr 
2015 2020 2025 

GAG TGG GGT CAT TAC CCA CGT ATT GAG CGG TCT CGC CTT GAT GGC ACA 
Glu Trp Gly His Tyr Pro Arg He Glu Arg Ser Arg Leu Asp Gly Thr 
2030 ' 2035 ' " 2040 

GAG AGA GTG GTG TTG GTT AAT GTC AGC ATC AGC TGG CCC AAT GGC ATC 
Glu Arg Val Val Leu Val Asn Val Ser He Ser Trp Pro Asn Gly He 
2045 2050 2055 

TCA GTA GAC TAT CAG GGC GGC AAG CTC TAC TGG TGT GAT GCT CGG ATG 
Ser Val Asp Tyr Gin Gly Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met 
2060 " " 2065 2070 

GAC AAG ATC GAG CGC ATC GAC CTG GAA ACG GGC GAG AAC CGG GAG GTG 
Asp Lys lie Glu Arg He Asp Leu Glu Thr Gly Glu Asn Arg Glu Val 
2075 2080 2085 2090 

GTC CTG TCC AGC AAT AAC ATG GAT ATG TTC TCC GTG TCC GTG TTT GAG 
Val Leu Ser Ser Asn Asn Met Asp Met Phe Ser Val Ser Val Phe Glu 
2095 2100 2105 

GAC TTC ATC TAC TGG AGT GAC AGA ACT CAC GCC AAT GGC TCC ATC AAG 
Asp Phe He Tyr Trp Ser Asp Arg Thr His Ala Asn Gly Ser He Lys 
2110 2115 2120 
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CGC GGC TGC AAA GAC AAT GCT ACA GAC TCC GTG CCT CTG AGG ACA GGC 
Arg Gly Cys Lys Asp Asn Ala Thr Asp Ser Val Pro Leu Arg Thr Gly 
2125 2130 2135 

ATT GGT GTT CAG CTT AAA GAC ATC AAG GTC TTC AAC AGG GAC AGG CAG 
lie Gly Val Gin Leu Lys Asp lie Lys Val Phe Asn Arg Asp Arg Gin 
2140 2145 2150 

AAG GGT ACC AAT GTG TGC GCG GTA GCC AAC GGC GGG TGC CAG CAG CTC 
Lys Gly Thr Asn Val Cys Ala Val Ala Asn Gly Gly Cys Gin Gin Leu 
2155 2160 2165 2170 

TGC TTG TAT CGG GGT GGC GGA CAG CGA GCC TGT GCC TGT GCC CAC GGG 
Cys Leu Tyr Arg Gly Gly Gly Gin Arg Ala Cys Ala Cys Ala His Gly 
2175 2180 2185 

ATG CTG GCA GAA GAC GGG GCC TCA TGC CGA GAG TAC GCT GGC TAC CTG 
Met Leu Ala Glu Asp Gly Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu 
2190 2195 2200 

CTC TAC TCA GAG CGG ACC ATC CTC AAG AGC ATC CAC CTG TCG GAT GAG 
Leu Tyr Ser Glu Arg Thr lie Leu Lys Ser lie His Leu Ser Asp Glu 
2205 2210 2215 

CGT AAC CTC AAC GCA CCG GTG CAG CCC TTT GAA GAC CCC GAG CAC ATG 
Arg Asn Leu Asn Ala Pro Val Gin Pro Phe Glu Asp Pro Glu His Met 
2220 2225 2230 

AAA AAT GTC ATC GCC CTG GCC TTT GAC TAC CGA GCA GGC ACC TCC CCG 
Lys Asn Val lie Ala Leu Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro 
2235 2240 2245 2250 

GGG ACC CCT AAC CGC ATC TTC TTC AGT GAC ATC CAC TTT GGG AAC ATC 
Gly Thr Pro Asn Arg lie Phe Phe Ser Asp lie His Phe Gly Asn lie 
2255 2260 2265 

CAG CAG ATC AAT GAC GAT GGC TCG GGC AGG ACC ACC ATC GTG GAA AAT 
Gin Gin lie Asn Asp Asp Gly Ser Gly Arg Thr Thr He Val Glu Asn 
2270 2275 2280 

GTG GGC TCT GTG GAA GGC CTG GCC TAT CAC CGT GGC TGG GAC ACA CTG 
Val Gly Ser Val Glu Gly Leu Ala Tyr His Arg Gly Trp Asp Thr Leu 
2285 2290 2295 

TAC TGG ACA AGC TAC ACC ACA TCC ACC ATC ACC CGC CAC ACC GTG GAC 
Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He Thr Arg His Thr Val Asp 
2300 2305 2310 

CAG ACT CGC CCA GGG GCC TTC GAG AGG GAG ACA GTC ATC ACC ATG TCC 
Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu Thr Val He Thr Met Ser 
2315 2320 2325 2330 

GGA GAC GAC CAC CCG AGA GCC TTT GTG CTG GAT GAG TGC CAG AAC CTG 
Gly Asp Asp His Pro Arg Ala Phe Val Leu Asp Glu Cys Qln Asn Leu 
2335 2340 2345 

ATG TTC TGG ACC AAT TGG AAC GAG CTC CAT CCA AGC ATC ATG CGG GCA 
Met Phe Trp Thr Asn Trp Asn Glu Leu His Pro Ser He Met Arg Ala 
2350 2355 2360 
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GCC CTA TCC GGA GCC AAC GTC CTG ACC CTC ATT GAG AAG GAC ATC CGC 
Ala Leu Ser Gly Ala Asn Val Leu Thr Leu lie Glu Lys Asp lie Arg 
2365 2370 2375 

ACG CCC AAT GGG TTG GCC ATC GAC CAC CGG GCG GAG AAG CTG TAC TTC 
Thr Pro Asn Gly Leu Ala lie Asp His Arg Ala Glu Lys Leu Tyr Phe 
2380 2385 2390 

TCG GAT GCC ACC TTG GAC AAG ATC GAG CGC TGC GAG TAC GAC GGC TCC 
Ser Asp Ala Thr Leu Asp Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser 
2395 2400 2405 2410 

CAC CGC TAT GTG ATC CTA AAG TCG GAG CCC GTC CAC CCC TTT GGG TTG 
His Arg Tyr Val He Leu Lys Ser Glu Pro Val His Pro Phe Gly Leu 
2415 2420 2425 

GCG GTG TAC GGA GAG CAC ATT TTC TGG ACT GAC TGG GTG CGG CGG GCT 
Ala Val Tyr Gly Glu His lie Phe Trp Thr Asp Trp Val Arg Arg Ala 
2430 2435 2440 

GTG CAG CGA GCC AAC AAG TAT GTG GGC AGC GAC ATG AAG CTG CTT CGG 
Val Gin Arg Ala Asn Lys Tyr Val Gly Ser Asp Met Lys Leu Leu Arg 
2445 2450 2455 

GTG GAC ATT CCC CAG CAA CCC ATG GGC ATC ATC GCC GTG GCC AAT GAC 
Val Asp He Pro Gin Gin Pro Met Gly He He Ala Val Ala Asn Asp 
2460 2465 2470 

ACC AAC AGC TGT GAA CTC TCC CCC TGC CGT ATC AAC AAT GGA GGC TGC 
Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg He Asn Asn Gly Gly Cys 
2475 2480 2485 2490 

CAG GAT CTG TGT CTG CTC ACC CAC CAA GGC CAC GTC AAC TGT TCC TGT 
Gin Asp Leu Cys Leu Leu Thr His Gin Gly His Val Asn Cys Ser Cys 
2495 2500 2505 

CGA GGG GGC CGG ATC CTC CAG GAG GAC TTC ACC TGC CGG GCT GTG AAC 
Arq Glv Gly Arg He Leu Gin Glu Asp Phe Thr Cys Arg Ala Val Asn 
2510 2515 2520 

TCC TCT TGT CGG GCA CAA GAT GAG TTT GAG TGT GCC AAT GGG GAA TGT 
Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys 
2525 2530 2535 

ATC AGC TTC AGC CTC ACC TGT GAT GGC GTC TCC CAC TGC AAG GAC AAG 
He Ser Phe Ser Leu Thr Cys Asp Gly Val Ser His Cys Lys Asp Lys 
2540 2545 2550 

TCC GAT GAG AAG CCC TCC TAC TGC AAC TCA CGC CGC TGC AAG AAG ACT 
Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr 
2555 2560 2565 2570 

TTC CGC CAG TGT AAC AAT GGC CGC TGT GTA TCC AAC ATG CTG TGG TGC 
Phe Arq Gin Cys Asn Asn Gly Arg Cys Val Ser Asn Met Leu Trp Cys 
2575 2580 2585 

AAT GGG GTG GAT TAC TGT GGG GAT GGC TCT GAT GAG ATA CCT TGC AAC 
Asn Gly Val Asp Tyr Cys Gly Asp Gly Ser Asp Glu He Pro Cys Asn 
2590 2595 2600 
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AAG ACT GCC TGT GGT GTG GGT GAG TTC CGC TGC CGG GAT GGG TCC TGC 
Lys Thr Ala Cys Gly Val Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys 
2605 " 2610 ' " 2615 

ATC GGG AAC TCC AGT CGC TGC AAC CAG TTT GTG GAT TGT GAG GAT GCC 
He Gly Asn Ser Ser Arg Cys Asn Gin Phe Val Asp Cys Glu Asp Ala 
2620 2625 2630 

TCG GAT GAG ATG AAT TGC AGT GCC ACA GAC TGC AGC AGC TAT TTC CGC 
Ser Asp Glu Met Asn Cys Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg 
2635 1 2640 2645 2650 

CTG GGC GTG AAA GGT GTC CTC TTC CAG CCG TGC GAG CGG ACA TCC CTG 
Leu Gly Val Lys Gly Val Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu 
2655 2660 2665 

TGC TAC GCA CCT AGC TGG GTG TGT GAT GGC GCC AAC GAC TGT GGA GAC 
Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly Ala Asn Asp Cys Gly Asp 
2670 2675 2680 

TAC AGC GAT GAA CGT GAC TGT CCA GGT GTG AAG CGC CCT AGG TGC CCG 
Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val Lys Arg Pro Arg Cys Pro 
2685 '" 2690 ' 2695 

CTC AAT TAC TTT GCC TGC CCC AGC GGG CGC TGT ATC CCC ATG AGC TGG 
Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg Cys He Pro Met Ser Trp 
2700 2705 " 2710 

ACG TGT GAC AAG GAG GAT GAC TGT GAG AAC GGC GAG GAT GAG ACC CAC 
Thr Cys Asp Lys Glu Asp Asp Cys Glu Asn Gly Glu Asp Glu Thr His 
2715 " 2720 2725 2730 

TGC AAC AAG TTC TGC TCA GAG GCA CAG TTC GAG TGC CAG AAC CAC CGG 
Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe Glu Cys Gin Asn His Arg 
2735 2740 2745 

TGT ATC TCC AAG CAG TGG CTG TGT GAC GGT AGC GAT GAT TGC GGG GAT 
Cys He Ser Lys Gin Trp Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp 
2750 2755 2760 

GGC TCC GAT GAG GCA GCT CAC TGT GAA GGC AAG ACA TGT GGC CCC TCC 
Gly Ser Asp Glu Ala Ala His Cys Glu Gly Lys Thr Cys Gly Pro Ser 
2765 2770 2775 

TCC TTC TCC TGT CCC GGC ACC CAC GTG TGT GTC CCT GAG CGC TGG CTC 
Ser Phe Ser Cys Pro Gly Thr His Val Cys Val Pro Glu Arg Trp Leu 
2780 ~ 2785 2790 

TGT GAT GGC GAC AAG GAC TGT ACC GAT GGC GCG GAT GAG AGT GTC ACT 
Cys Asp Gly Asp Lys Asp Cys Thr Asp Gly Ala Asp Glu Ser Val Thr 
2795 2800 2805 2810 

GCT GGC TGC CTG TAC AAC AGC ACC TGT GAT GAC CGT GAG TTC ATG TGC 
Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp Asp Arg Glu Phe Met Cys 
2815 2820 2825 

CAG AAC CGC TTG TGT ATT CCC AAG CAT TTC GTG TGC GAC CAT GAC CGT 
Gin Asn Arg Leu Cys He Pro Lys His Phe Val Cys Asp His Asp Arg 
2830 2835 2840 
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GAC TGT GCT GAT GGC TCT GAT GAA TCC CCT GAG TGT GAG TAC CCA ACC 
Asp Cys Ala Asp Gly Ser Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr 
2845 2850 2855 

TGC GGG CCC AAT GAA TTC CGC TGT GCC AAT GGG CGT TGT CTG AGC TCC 
Cys Gly Pro Asn Glu Phe Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser 
2860 2865 2870 

CGT CAG TGG GAA TGT GAT GGG GAG AAT GAC TGT CAC GAC CAC AGC GAT 
Arq Gin Trp Glu Cys Asp Gly Glu Asn Asp Cys His Asp His Ser Asp 
2875 2880 2885 2890 

GAG GCT CCC AAG AAC CCA CAC TGC ACC AGC CCA GAG CAC AAA TGC AAT 
Glu Ala Pro Lys Asn Pro His Cys Thr Ser Pro Glu His Lys Cys Asn 
2895 2900 2905 

GCC TCA TCA CAG TTC CTG TGC AGC AGC GGG CGC TGC GTG GCT GAG GCG 
Ala Ser Ser Gin Phe Leu Cys Ser Ser Gly Arg Cys Val Ala Glu Ala 
2910 2915 2920 

TTG CTC TGC AAC GGC CAG GAC GAC TGT GGG GAC GGT TCA GAC GAA CGC 
Leu Leu Cys Asn Gly Gin Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg 
2925 2930 2935 

GGG TGC CAT GTC AAC GAG TGT CTC AGC CGC AAG CTC AGT GGC TGC AGT 
Gly Cys His Val Asn Glu Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser 
2940 2945 2950 

CAG GAC TGC GAG GAC CTC AAG ATA GGC TTT AAG TGC CGC TGT CGC CCG 
Gin Asp Cys Glu Asp Leu Lys lie Gly Phe Lys Cys Arg Cys Arg Pro 
2955 2960 2965 2970 

GGC TTC CGG CTA AAG GAC GAT GGC AGG ACC TGT GCC GAC CTG GAT GAG 
Glv Phe Arg Leu Lys Asp Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu 
2975 2980 2985 

TGC AGC ACC ACC TTC CCC TGC AGC CAG CTC TGC ATC AAC ACC CAC GGA 
Cvs Ser Thr Thr Phe Pro "Cys Ser Gin Leu Cys He Asn Thr His Gly 
2990 2995 3000 

AGT TAC AAG TGT CTG TGT GTG GAG GGC TAT GCA CCC CGT GGC GGT GAC 
Ser Tvr Lys Cys Leu Cys Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp 
3005 3010 3015 

CCC CAC AGC TGC AAA GCT GTG ACC GAT GAG GAG CCA TTT CTC ATC TTT 
Pro His Ser Cys Lys Ala Val Thr Asp Glu Glu Pro Phe Leu He Phe 
3020 3025 3030 

GCC AAC CGG TAC TAC CTG CGG AAG CTC AAC CTG GAC GGC TCC AAC TAC 
Ala Asn Arg Tyr Tyr Leu Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr 
3035 3040 3045 3050 

ACA CTG CTT AAG CAG GGC CTG AAC AAT GCG GTC GCC TTG GCA TTT GAC 
Thr Leu Leu Lys Gin Gly Leu Asn Asn Ala Val Ala Leu Ala Phe Asp 
3055 3060 3065 

TAC CGA GAG CAG ATG ATC TAC TGG ACG GGC GTG ACC ACC CAG GGC AGC 
Tvr Arq Glu Gin Met He Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser 
y 3070 3075 3080 
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ATG ATT CGC AGG ATG CAC CTC AAC GGC AGC AAC GTG CAG GTT CTG CAC 
Met lie Arg Arg Met His Leu Asn Gly Ser Asn Val Gin Val Leu His 
3085 3090 3095 

CGG ACG GGC CTT AGT AAC CCA GAT GGG CTC GCT GTG GAC TGG GTG GGT 
Arg Thr Gly Leu Ser Asn Pro Asp Gly Leu Ala Val Asp Trp Val Gly 
3100 3105 3110 

GGC AAC CTG TAC TGG TGT GAC AAG GGC AGA GAT ACC ATT GAG GTG TCC 
Gly Asn Leu Tyr Trp Cys Asp Lys Gly Arg Asp Thr lie Glu Val Ser 
3115 3120 3125 3130 

AAG CTT AAC GGG GCC TAT CGG ACA GTG CTG GTC AGC TCT GGC CTC CGG 
Lys Leu Asn Gly Ala Tyr Arg Thr Val Leu Val Ser Ser Gly Leu Arg 
3135 3140 3145 

GAG CCC AGA GCT CTG GTA GTG GAT GTA CAG AAT GGG TAC CTG TAC TGG 
Glu Pro Arg Ala Leu Val Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp 
3150 3155 3160 

ACA GAC TGG GGT GAC CAC TCA CTG ATC GGC CGG ATT GGC ATG GAT GGA 
Thr Asp Trp Gly Asp His Ser Leu lie Gly Arg lie Gly Met Asp Gly 
3165 3170 3175 

TCT GGC CGC AGC ATC ATC GTG GAC ACT AAG ATC ACA TGG CCC AAT GGC 
Ser Gly Arg Ser lie lie Val Asp Thr Lys lie Thr Trp Pro Asn Gly 
3180 3185 3190 

CTG ACC GTG GAC TAC GTC ACG GAA CGC ATC TAC TGG GCT GAC GCC CGT 
Leu Thr Val Asp Tyr Val Thr Glu Arg lie Tyr Trp Ala Asp Ala Arg 
3195 " 3200 3205 3210 

GAG GAC TAC ATC GAG TTC GCC AGC CTG GAT GGC TCC AAC CGT CAC GTT 
Glu Asp Tyr lie Glu Phe Ala Ser Leu Asp Gly Ser Asn Arg His Val 
3215 3220 3225 

GTG CTG AGC CAA GAC ATC CCA CAC ATC TTT GCG CTG ACC CTA TTT GAA 
Val Leu Ser Gin Asp He Pro His He Phe Ala Leu Thr Leu Phe Glu 
3230 3235 3240 

GAC TAC GTC TAC TGG ACA GAC TGG GAA ACG AAG TCC ATC AAC CGG GCC 
Asp Tyr Val Tyr Trp Thr Asp Trp Glu Thr Lys Ser lie Asn Arg Ala 
3245 3250 3255 

CAC AAG ACC ACG GGT GCC AAC AAA ACA CTC CTC ATC AGC ACC CTG CAC 
His Lys Thr Thr Gly Ala Asn Lys Thr Leu Leu He Ser Thr Leu His 
3260 3265 3270 

CGG CCC ATG GAC TTA CAT GTA TTC CAC GCC CTG CGC CAG CCA GAT GTG 
Arg Pro Met Asp Leu His Val Phe His Ala Leu Arg Gin Pro Asp Val 
3275 3280 3285 3290 

CCC AAT CAC CCC TGC AAA GTC AAC AAT GGT GGC TGC AGC AAC CTG TGC 
Pro Asn His Pro Cys Lys Val Asn Asn Gly Gly Cys Ser Asn Leu Cys 
3295 3300 ' 3305 

CTG CTG TCC CCT GGG GGT GGT CAC AAG TGC GCC TGC CCC ACC AAC TTC 
Leu Leu Ser Pro Gly Gly Gly His Lys Cys Ala Cys Pro Thr Asn Phe 
3310 3315 3320 
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TAT CTG GGT GGC GAT GGC CGT ACC TGT GTG TCC AAC TGC ACA GCA AGC 
Tyr Leu Gly Gly Asp Gly Arg Thr Cys Val Ser Asn Cys Thr Ala Ser 
3325 3330 3335 

CAG TTT GTG TGC AAA AAT GAC AAG TGC ATC CCC TTC TGG TGG AAG TGT 
Gin Phe Val Cys Lys Asn Asp Lys Cys lie Pro Phe Trp Trp Lys Cys 
3340 3345 3350 

GAC ACG GAG GAC GAC TGT GGG GAT CAC TCA GAC GAG CCT CCA GAC TGT 
Asp Thr Glu Asp Asp Cys Gly Asp His Ser Asp Glu Pro Pro Asp Cys 
3355 3360 3365 3370 

CCC GAG TTC AAG TGC CGC CCA GGC CAG TTC CAG TGC TCC ACC GGC ATC 
Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe Gin Cys Ser Thr Gly lie 
3375 3380 3385 

TGC ACC AAC CCT GCC TTC ATC TGT GAT GGG GAC AAT GAC TGC CAA GAC 
Cys Thr Asn Pro Ala Phe lie Cys Asp Gly Asp Asn Asp Cys Gin Asp 
3390 3395 3400 

AAT AGT GAC GAG GCC AAT TGC GAC ATT CAC GTC TGC TTG CCC AGC CAA 
Asn Ser Asp Glu Ala Asn Cys Asp lie His Val Cys Leu Pro Ser Gin 
3405 3410 3415 

TTC AAG TGC ACC AAC ACC AAC CGC TGC ATT CCT GGC ATC TTC CGT TGC 
Phe Lys Cys Thr Asn Thr Asn Arg Cys lie Pro Gly He Phe Arg Cys 
3420 3425 3430 

AAT GGG CAG GAC AAC TGC GGG GAC GGC GAG GAT GAG CGG GAT TGC CCT 
Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro 
3435 3440 3445 3450 

GAG GTG ACC TGC GCC CCC AAC CAG TTC CAG TGC TCC ATC ACC AAG CGC 
Glu Val Thr Cys Ala Pro Asn Gin Phe Gin Cys Ser He Thr Lys Arg 
3455 3460 3465 

TGC ATC CCT CGC GTC TGG GTC TGT GAC AGG GAT AAT CAC TGT GTG GAC 
Cys He Pro Arg Val Trp Val Cys Asp Arg Asp Asn His Cys Val Asp 
3470 3475 3480 

GGC AGT GAT GAG CCT GCC AAC TGT ACC CAA ATG ACC TGT GGA GTG GAT 
Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin Met Thr Cys Gly Val Asp 
3485 3490 3495 

GAG TTC CGC TGC AAG GAT TCT GGC CGC TGC ATC CCC GCG CGC TGG AAG 
Glu Phe Arg Cys Lys Asp Ser Gly Arg Cys He Pro Ala Arg Trp Lys 
3500 3505 3510 

TGT GAC GGA GAA GAT GAC TGT GGG GAT GGT TCA GAT GAG CCC AAG GAA 
Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu 
3515 3520 3525 3530 

GAG TGT GAT GAG CGC ACC TGT GAG CCA TAC CAG TTC CGC TGC AAA AAC 
Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn 
3535 3540 3545 

AAC CGC TGT GTC CCA GGC CGT TGG CAA TGT GAC TAC GAC AAC GAC TGC 
Asn Arg Cys Val Pro Gly Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys 
3550 3555 3560 



FIG. 6a 



8449 -128 (SHEET OF fr{ ) 



GGA GAT AAC TCG GAC GAG GAG AGC TGC ACA CCT CGG CCC TGC TCT GAG 
Gly Asp Asn Ser Asp Glu Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu 
3565 3570 3575 

AGT GAG TTT TTC TGT GCC AAT GGC CGC TGC ATC GCT GGG CGC TGG AAG 
Ser Glu Phe Phe Cys Ala Asn Gly Arg Cys lie Ala Gly Arg Trp Lys 
3580 3585 3590 

TGT GAT GGG GAC CAT GAC TGT GCC GAC GGC TCA GAC GAG AAA GAC TGC 
Cys Asp Gly Asp His Asp Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys 
3595 3600 3605 3610 

ACC CCC CGC TGT GAT ATG GAC CAG TTC CAG TGC AAG AGT GGC CAC TGC 
Thr Pro Arg Cys Asp Met Asp Gin Phe Gin Cys Lys Ser Gly His Cys 
3615 3620 3625 

ATC CCC CTG CGC TGG CCG TGT GAC GCG GAT GCT GAC TGT ATG GAC GGC 
lie Pro Leu Arg Trp Pro Cys Asp Ala Asp Ala Asp Cys Met Asp Gly 
3630 3635 3640 

AGT GAC GAG GAA GCC TGT GGC ACT GGG GTG AGG ACC TGC CCA TTG GAT 
Ser Asp Glu Glu Ala Cys Gly Thr Gly Val Arg Thr Cys Pro Leu Asp 
3645 3650 3655 

GAG TTT CAA TGT AAC AAC ACC TTG TGC AAG CCG CTG GCC TGG AAG TGT 
Glu Phe Gin Cys Asn Asn Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys 
3660 3665 3670 

GAT GGA GAG GAC GAC TGT GGG GAC AAC TCA GAT GAG AAC CCC GAG GAA 
Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu 
3675 3680 3685 3690 

TGC GCC CGG TTC ATC TGC CCT CCC AAC CGG CCT TTC CGC TGC AAG AAT 
Cys Ala Arg Phe lie Cys Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn 
3695 3700 3705 

GAC CGA GTC TGC CTG TGG ATT GGG CGC CAG TGT GAT GGC GTG GAC AAC 
Asp Arg Val Cys Leu Trp lie Gly Arg Gin Cys Asp Gly Val Asp Asn 
3710 3715 3720 

TGT GGA GAT GGG ACT GAC GAG GAG GAC TGT GAG CCC CCC ACG GCC CAG 
Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin 
3725 3730 3735 

AAC CCC CAC TGC AAA GAC AAG AAG GAG TTC CTG TGC CGA AAC CAG CGC 
Asn Pro His Cys Lys Asp Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg 
3740 3745 3750 

TGT CTA TCA TCC TCC CTG CGC TGT AAC ATG TTC GAT GAC TGC GGC GAT 
Cys Leu Ser Ser Ser Leu Arg Cys Asn Met Phe Asp Asp Cys Gly Asp 
3755 3760 3765 3770 

GGC TCC GAT GAA GAA GAT TGC AGC ATC GAC CCC AAG CTG ACC AGC TGT 
Gly Ser Asp Glu Glu Asp Cys Ser lie Asp Pro Lys Leu Thr Ser Cys 
3775 3780 3785 

GCC ACC AAT GCC AGC ATG TGT GGG GAC GAA GCT CGT TGT GTG CGC ACT 
Ala Thr Asn Ala Ser Met Cys Gly Asp Glu Ala Arg Cys Val Arg Thr 
3790 3795 3800 



FIG. 6a 



8449-128 (SHEETS? OF 



GAG AAA GCT GCC TAC TGT GCC TGC CGC TCG GGC TTC CAT ACT GTG CCG 
Glu Lys Ala Ala Tyr Cys Ala Cys Arg Ser Gly Phe His Thr Val Pro 
3805 3810 3815 

GGC CAG CCC GGA TGC CAG GAC ATC AAC GAG TGC CTG CGC TTT GGT ACC 
Gly Gin Pro Gly Cys Gin Asp lie Asn Glu Cys Leu Arg Phe Gly Thr 
3820 3825 3830 

TGC TCT CAG CTC TGG AAC AAA CCC AAG GGA GGC CAC CTC TGC AGC TGT 
Cys Ser Gin Leu Trp Asn Lys Pro Lys Gly Gly His Leu Cys Ser Cys 
3835 3840 3845 3850 

GCC CGC AAC TTC ATG AAG ACA CAC AAC ACC TGC AAA GCT GAA GGC TCC 
Ala Arg Asn Phe Met Lys Thr His Asn Thr Cys Lys Ala Glu Gly Ser 
3855 3860 3865 

GAG TAC CAG GTG CTA TAC ATC GCG GAT GAC AAC GAG ATC CGC AGC TTG 
Glu Tyr Gin Val Leu Tyr lie Ala Asp Asp Asn Glu lie Arg Ser Leu 
3870 3875 3880 

TTC CCG GGC CAC CCC CAC TCA GCC TAC GAG CAG ACA TTC CAG GGC GAT 
Phe Pro Gly His Pro His Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp 
3885 3890 3895 

GAG AGT GTC CGC ATA GAT GCC ATG GAT GTC CAT GTC AAG GCC GGC CGT 
Glu Ser Val Arg lie Asp Ala Met Asp Val His Val Lys Ala Gly Arg 
3900 3905 3910 

GTC TAC TGG ACT AAC TGG CAC ACG GGC ACA ATC TCC TAC AGG AGC CTG 
Val Tyr Trp Thr Asn Trp His Thr Gly Thr lie Ser Tyr Arg Ser Leu 
3915 3920 3925 3930 

CCC CCT GCC GCC CCT CCT ACC ACT TCC AAC CGC CAC CGG AGG CAG ATC 
Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn Arg His Arg Arg Gin lie 
3935 3940 3945 

GAC CGG GGT GTC ACC CAC CTC AAT ATT TCA GGG CTG AAG ATG CCG AGG 
Asp Arg Gly Val Thr His Leu Asn He Ser Gly Leu Lys Met Pro Arg 
3950 3955 3960 

GGT ATC GCT ATC GAC TGG GTG GCC GGG AAT GTG TAC TGG ACC GAT TCC 
Gly He Ala He Asp Trp Val Ala Gly Asn Val Tyr Trp Thr Asp Ser 
3965 3970 3975 

GGC CGA GAC GTG ATT GAG GTG GCG CAA ATG AAG GGC GAG AAC CGC AAG 
Gly Arg Asp Val He Glu Val Ala Gin Met Lys Gly Glu Asn Arg Lys 
3980 3985 3990 

ACG CTC ATC TCG GGC ATG ATT GAT GAG CCC CAT GCC ATC GTG GTG GAC 
Thr Leu He Ser Gly Met He Asp Glu Pro His Ala He Val Val Asp 
3995 4000 4005 4010 

CCT CTG AGG GGC ACC ATG TAC TGG TCA GAC TGG GGG AAC CAC CCC AAG 
Pro Leu Arg Gly Thr Met Tyr Trp Ser Asp Trp Gly Asn His Pro Lys 
4015 4020 4025 

ATT GAA ACA GCA GCG ATG GAT GGC ACC CTT CGG GAG ACT CTC GTG CAA 
He Glu Thr Ala Ala Met Asp Gly Thr Leu Arg Glu Thr Leu Val Gin 
4030 4035 4040 
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GAC AAC ATT CAG TGG CCT ACA GGG CTG GCT GTG GAC TAT CAC AAT GAA 12 615 
Asp Asn lie Gin Trp Pro Thr Gly Leu Ala Val Asp Tyr His Asn Glu 
4045 4050 4055 

CGG CTC TAC TGG GCA GAT GCC AAG CTT TCG GTC ATC GGC AGC ATC CGG 12 663 
Arg Leu Tyr Trp Ala Asp Ala Lys Leu Ser Val He Gly Ser He Arg 
4060 4065 4070 

CTC AAC GGC ACT GAC CCC ATT GTG GCT GCT GAC AGC AAA CGA GGC CTA 12711 
Leu Asn Gly Thr Asp Pro He Val Ala Ala Asp Ser Lys Arg Gly Leu 
4075 4080 4085 4090 

AGT CAC CCC TTC AGC ATC GAT GTG TTT GAA GAC TAC ATC TAC GGA GTC 12759 
Ser His Pro Phe Ser He Asp Val Phe Glu Asp Tyr He Tyr Gly Val 
4095 4100 4105 

ACT TAC ATC AAT AAT CGT GTC TTC AAG ATC CAC AAG TTT GGA CAC AGC 12807 
Thr Tyr He Asn Asn Arg Val Phe Lys He His Lys Phe Gly His Ser 
4110 4115 4120 

CCC TTG TAC AAC CTA ACT GGG GGC CTG AGC CAT GCC TCT GAT GTA GTC 12855 
Pro Leu Tyr Asn Leu Thr Gly Gly Leu Ser His Ala Ser Asp Val Val 
4125 4130 4135 

CTT TAC CAT CAA CAC AAG CAG CCT GAA GTG ACC AAC CCC TGT GAC CGC 12903 
Leu Tyr His Gin His Lys Gin Pro Glu Val Thr Asn Pro Cys Asp Arg 
4140 4145 4150 

AAG AAA TGC GAA TGG CTG TGT CTG CTG AGC CCC AGC GGG CCT GTC TGC 12 951 
Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser Pro Ser Gly Pro Val Cys 
4155 4160 4165 4170 

ACC TGT CCC AAT GGA AAG AGG CTG GAT AAT GGC ACC TGT GTG CCT GTG 12 999 
Thr Cys Pro Asn Gly Lys Arg Leu Asp Asn Gly Thr Cys Val Pro Val 
4175 4180 4185 

CCC TCT CCA ACA CCC CCT CCA GAT GCC CCT AGG CCT GGA ACC TGC ACT 13047 . 
Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr 
4190 4195 4200 

CTG CAG TGC TTC AAT GGT GGT AGT TGT TTC CTC AAC GCT CGG AGG CAG 13095 
Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin 
4205 4210 4215 

CCC AAG TGC CGT TGC CAG CCC CGT TAC ACA GGC GAT AAG TGT GAG CTG 13143 
Pro Lys Cys Arg Cys Gin Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu 
4220 4225 4230 

GAT CAG TGC TGG GAA TAC TGT CAC AAC GGA GGC ACC TGT GCG GCT TCC 13191 
Asp Gin Cys Trp Glu Tyr Cys His Asn Gly Gly Thr Cys Ala Ala Ser 
4235 4240 4245 4250 

CCA TCT GGC ATG CCC ACG TGC CGC TGT CCC ACT GGC TTC ACG GGC CCC 13239 
Pro Ser Gly Met Pro Thr Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro 
4255 4260 ' 4265 

AAA TGC ACC GCA CAG GTG TGT GCA GGC TAC TGC TCT AAC AAC AGC ACC 13287 
Lys Cys Thr Ala Gin Val Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr 
4270 4275 4280 
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TGC ACC GTC AAC CAG GGC AAC CAG CCC CAG TGC CGA TGT CTA CCT GGC 
Cys Thr Val Asn Gin Gly Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly 
4285 4290 4295 

TTC CTG GGC GAC CGT TGC CAG TAC CGG CAG TGC TCT GGC TTC TGT GAG 
Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu 
4300 4305 4310 

AAC TTT GGC ACC TGT CAG ATG GCT GCT GAT GGC TCC CGA CAA TGT CGC 
Asn Phe Gly Thr Cys Gin Met Ala Ala Asp Gly Ser Arg Gin Cys Arg 
4315 4320 4325 4330 

TGC ACC GTC TAC TTT GAG GGA CCA AGG TGT GAG GTG AAC AAG TGT AGT 
Cys Thr Val Tyr Phe Glu Gly Pro Arg Cys Glu Val Asn Lys Cys Ser 
4335 4340 4345 

CGC TGT CTC CAA GGC GCC TGT GTG GTC AAT AAG CAG ACC GGA GAT GTC 
Arg Cys Leu Gin Gly Ala Cys Val Val Asn Lys Gin Thr Gly Asp Val 
4350 4355 4360 

ACA TGC AAC TGC ACT GAT GGC CGG GTA GCC CCC AGT TGT CTC ACC TGC 
Thr Cys Asn Cys Thr Asp Gly Arg Val Ala Pro Ser Cys Leu Thr Cys 
4365 4370 4375 

ATC GAT CAC TGT AGC AAT GGT GGC TCC TGC ACC ATG AAC AGC AAG ATG 
lie Asp His Cys Ser Asn Gly Gly Ser Cys Thr Met Asn Ser Lys Met 
4380 4385 4390 

ATG CCT GAG TGC CAG TGC CCG CCC CAT ATG ACA GGA CCC CGG TGC CAG 
Met Pro Glu Cys Gin Cys Pro Pro His Met Thr Gly Pro Arg Cys Gin 
4395 4400 4405 4410 

GAG CAG GTT GTT AGT CAG CAA CAG CCT GGG CAT ATG GCC TCC ATC CTG 
Glu Gin Val Val Ser Gin Gin Gin Pro Gly His Met Ala Ser lie Leu 
4415 4420 4425 

ATC CCT CTG CTG CTG CTT CTC CTG CTG CTT CTG GTG GCT' GGC GTG GTG 
lie Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu Val Ala Gly Val Val 
4430 4435 4440 

TTC TGG TAT AAG CGG CGA GTC CGA GGG GCT AAG GGC TTC CAG CAC CAG 
Phe Trp Tyr Lys Arg Arg Val Arg Gly Ala Lys Gly Phe Gin His Gin 
4445 4450 4455 

CGG ATG ACC AAT GGG GCC ATG AAT GTG GAA ATT GGA AAC CCT ACC TAC 
Arg Met Thr Asn Gly Ala Met Asn Val Glu lie Gly Asn Pro Thr Tyr 
4460 4465 4470 

AAG ATG TAT GAA GGT GGA GAG CCC GAT GAT GTC GGG GGC CTA CTG GAT 
Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp Val Gly Gly Leu Leu Asp 
4475' 4480 4485 4490 

GCT GAT TTT GCC CTT GAC CCT GAC AAG CCT ACC AAC TTC ACC AAC CCA 
Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro 
4495 4500 4505 

GTG TAT GCC ACG CTC TAC ATG GGG GGC CAC GGC AGC CGC CAT TCC CTG 
Val Tyr Ala Thr Leu Tyr Met Gly Gly His Gly Ser Arg His Ser Leu 
4510 4515 4520 
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GCC AGC ACG GAC GAG AAG CGA GAA CTG CTG GGC CGG GGA CCT GAA GAC 14 055 
Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp 
4525 4530 4535 

GAG ATA GGA GAT CCC TTG GCA TAGGGCCCTG CCCCGACGGA TGTCCCCAGA AAGC 14110 

CCCCTGCCAC ATGAGTCTTT CAATGAACCC CCTCCCCAGC CGGCCCTTCT CCGGCCCTGC 14170 
Glu lie Gly Asp Pro Leu Ala 
4540 4545 

CGGGTGTACA AATGTAAAAA TGAAGGAATT ACTTTTTATA TGTGAGCGAG CAAGCGAGCA 14 230 

AGCACAGTAT TATCTCTTTG CATTTCCTTC CTGCCTGCTC CTCAGTATCC CCCCCATGCT 14 290 

GCCTTGAGGG GGCGGGGAGG GCTTTGTGGC TCAAAGGTAT GAAGGAGTCC ACATGTTCCC 14 350 

TACCGAGCAT ACCCCTGGAA GCCTGGCGGC ACGGCCTCCC CACCACGCCT GT GC AAG AC A 14410 

CTCAACGGGG CTCCGTGTCC CAGCTTTCCT TTCCTTGGCT CTCTGGGGTT AGTTCAGGGG 14 470 

AGGTGGAGTC CTCTGCTGAC CCTGTCTGGA AGATTTGGCT CTAGCTGAGG AAGGAGTCTT 14530 

TTAGTTGAGG GAAGTCACCC CAAACCCCAG CTCCCACTTT CAGGGGCACC TCTCAGATGG 14590 

CCATGCTCAG TATCCCTTCC AG ACAGG CCC TCCCCTCTCT AGCGCCCCCT CTGTGGCTCC 14 650 

TAGGGCTGAA CACATTCTTT GGTAACTGTC CCCCAAGCCT CCCATCCCCC TGAGGGCCAG 14710 

GAAGAGTCGG GGCACACCAA GGAAGGGCAA GCGGGCAGCC CCATTTTGGG GACGTGAACG 14770 

TTTTAATAAT TTTTGCTGAA TTCCTTTACA ACTAAATAAC ACAGATATTG TTATAAATAA 14830 
AATTGTAAAA AAAAAAAAA 
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He Tyr His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu 
465 470 475 480 

Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Asp lie Cys Leu Leu 

485 490 495 

Ala Asn Ser His Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser 

500 505 510 

Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe 

515 520 525 

Leu Val Tyr Gly Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met 

530 " 535 540 

Gly Ala Lys Val Pro Asp Glu His Met He Pro He Glu Asn Leu Met 
545 550 555 560 

Asn Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe 

565 570 575 

Ala Asp Thr Thr Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr 

580 585 590 

Glu Arg Glu Thr He Leu Lys Asp Gly He His Asn Val Glu Gly Val 

595 600 605 

Ala Val Asp Trp Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro 

610 615 620 

Lys Lys Thr He Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg 
625 630 635 640 

Lys Thr Leu He Glu Gly Lys Met Thr His Pro Arg Ala He Val Val 

645 650 655 

Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro 

660 665 670 

Lys Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser 

675 680 685 

His Arg Asp He Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly 

690 695 700 

Leu Ser Leu Asp He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe 
705 710 715 720 

Tyr Asp Arg He Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He 

725 730 735 

Val Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His 

740 745 750 

Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg 

755 760 765 

Leu Glu Arg Gly Val Ala Gly Ala Pro Pro Thr Val Thr Leu Leu Arg 

770 775 780 

Ser Glu Arg Pro Pro lie Phe Glu He Arg Met Tyr Asp Ala His Glu 
785 790 795 800 

Gin Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser 

805 810 815 

Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu 

820 825 830 

Asp Gin Val Leu Asp Thr Asp Gly Val Thr Cys Leu Ala Asn Pro Ser 

835 840 845 

Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly Gin Phe Ala Cys Ala Asn 

850 855 860 

Asn Arg Cys He Gin Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys 
865 - 870 875 880 

Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu Cys His Gin His Thr Cys 

885 890 . 895 

Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn Arg Cys He Pro Asn Arg 

900 905 910 

Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu Ser 
915 920 925 
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Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys 

930 935 940 

Ala Ser Gly Arg Cys He Pro He Ser Trp Thr Cys Asp Leu Asp Asp 
945 950 955 960 

Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys He Asn 

980 985 990 

He Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
025 1030 1035 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu- Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 HOO 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
105 1110 1115 1120 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

11^0 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 H80 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
185 1190 1195 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He He Phe Ser Asn Arg His Glu He 
265 1270 1275 1280 

Arg Arg lie Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 . 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 

1380 1385 1390 

He Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp 
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1395 1400 ' 1405 

Ala Ser Leu Pro Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg 

1410 1415 1420 

Arg Thr lie His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
425 1430 1435 1440 

Thr Val Asp Tyr Leu Glu Lys Arg lie Leu Trp lie Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala lie Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
505 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu lie Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu lie Arg Gly Val Asp Leu Asp 
585 1590 1595 1600 

Ala Pro Tyr Tyr Asn Tyr lie lie Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
665 1670 1675 1680 

Asn Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He 
745 1750 1755 1760 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val He Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
825 1830 1835 1840 

lie Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 
1860 1865 1870 
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Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly lie Arg Gly 

1890 1895 1900 

lie Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly lie Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

lie Tyr Trp Val Asp Met Gly Leu Ser Thr lie Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly lie Gly Arg Val 

1955 1960 1965 

Glu Gly lie Ala Val Asp Trp lie Ala Gly Asn lie Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val lie Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
985 1990 1995 2000 

Tyr Val Val lie Ser Gin Gly Leu Asp Lys Pro Arg Ala lie Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 2030 

Arg lie Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 -2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
305 2310 2315 2320 

Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 
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2340 2345 2350 

Asn Glu Leu His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn 

2355 2360 * 2365 

Val Leu Thr Leu lie Glu Lys Asp lie Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

lie Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
385 2390 2395 ' 2400 

Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2405 " 2410 " 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

He Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp He Pro Gin Gin 

2450 2455 2460 

Pro Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
465 2470 2475 2480 

Ser Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2500 2505 ' 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 * 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys lie Ser Phe Ser Leu Thr 

2530 2535 " " 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
545 ' 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 * 2600 " 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys He Gly Asn Ser Ser Arg 

2610 " 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 " 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 ~ 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 * 2760 " ' 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 

2770 2775 2780 

Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
785 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 
2805 2810 2815 
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Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys lie 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
865 2870 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
945 2950 2955 2960 

Lys He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro Kis Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu 
025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met He 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
105 3110 3115 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser He He 

3170 3175 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 . 

Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 
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3285 



3290 



3295 



Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3330 3335 3340 

Asp Lys Cys He Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
345 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
505 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp Kis Asp 
585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
665 3670 3675 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

He Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
745 3750 3755 3760 
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Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin 

3810 3815 3820 

Asp lie Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
825 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

lie Ala Asp Asp Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg lie Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
905 3910 3915 3920 

His Thr Gly Thr lie Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
985 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 * 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 
065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 4105 4110 

Val Phe Lys He His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
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225 



4230 



4235 



4240 



Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
305 4310 4315 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys lie Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser lie Leu He Pro Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu He Gly Asp Pro Leu 
4530 4535 4540 

Ala 



545 
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GCTACAATCC ATCTGGTCTC CTCCAGCTCC TTCTTTCTGC AAC ATG GGG AAG AAC 

Met Gly Lys Asn 
1 

AAA CTC CTT CAT CCA AGT CTG GTT CTT CTC CTC TTG GTC CTC CTG CCC 
Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro 
5 10 15 20 

ACA GAC GCC TCA GTC TCT GGA AAA CCG CAG TAT ATG GTT CTG GTC CCC 
Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro 
25 30 35 

TCC CTG CTC CAC ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC 
Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser 
40 45 50 

TAC CTG AAT GAG ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG 
Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg 
55 60 65 

GGA AAC AGG AGC CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC 
Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu 
70 75 80 

CAC TGT GTC GCC TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA 
His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val 
85 90 95 100 

ATG TTC CTC ACT GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG 
Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys 
105 110 115 

CGG ACC ACA GTG ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG 
Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin 
120 125 130 

ACA GAC AAA TCA ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT 
Thr Asp Lys Ser He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val 
135 140 145 

GTC TCC ATG GAT GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA 
Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu Leu lie Pro Leu 
150 155 160 

GTA TAC ATT CAG GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT 
Val Tyr He Gin Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser 
165 170 175 180 

TTC CAG TTA GAG GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA 
Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser 
185 190 195 

GAG CCC TTC CAG GGC TCC TAC AAG GTG GTG GTA CAG AAG AAA TCA GGT 
Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly 
200 205 210 

GGA AGG ACA GAG CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG 
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Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys 
215 220 225 

TTT GAA GTA CAA GTA ACA GTG CCA AAG ATA ATC ACC ATC TTG GAA GAA 77 5 

Phe Glu Val Gin Val Thr Val Pro Lys lie lie Thr He Leu Glu Glu 
230 235 240 

GAG ATG AAT GTA TCA GTG TGT GGC CTA TAC ACA TAT GGG AAG CCT GTC 823 
Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val 
245 250 255 ~ 260 

CCT GGA CAT GTG ACT GTG AGC ATT TGC AGA AAG TAT AGT GAC GCT TCC 871 
Pro Gly His Val Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser 
265 270 " 275 

GAC TGC CAC GGT GAA GAT TCA CAG GCT TTC TGT GAG AAA TTC AGT GGA 919 
Asp Cys His Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly 
280 285 '" 290 

CAG CTA AAC AGC CAT GGC TGC TTC TAT CAG CAA GTA AAA ACC AAG GTC 967 
Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val 
295 300 305 

TTC CAG CTG AAG AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC 1015 
Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala 
310 315 320 

CAG ATC CAA GAA GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC 1063 
Gin He Gin Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser 
325 330 335 340 

AGT GAA ATC ACA AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC 1111 
Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp 
345 350 355 

TCA CAC TTT CGA CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA 1159 
Ser His Phe Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val 
360 365 370 

GAT GGG AAA GGC GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA 1207 
Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly 
375 380 385 

AAT GAA GCA AAC TAT TAC TCC AAT GCT ACC ACG GAT GAG CAT GGC CTT 1255 
Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu 
390 395 400 

GTA CAG TTC TCT ATC AAC ACC ACC AAC GTT ATG GGT ACC TCT CTT ACT 1303 
Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr 
405 410 415 420 

GTT AGG GTC AAT TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG 1351 
Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp 
425 " 430 " . 435 

GTG TCA GAA GAA CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC 1399 
Val Ser Glu Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe 
440 445 450 
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TCC CCA AGC AAG AGC TTT GTC CAC CTT GAG CCC ATG TCT CAT GAA CTA 

Ser Pro Ser Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu 
455 460 465 

CCC TGT GGC CAT ACT CAG ACA GTC CAG GCA CAT TAT ATT CTG AAT GGA 

Pro Cys Gly His Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly 
470 475 480 

GGC ACC CTG CTG GGG CTG AAG AAG CTC TCC TTT TAT TAT CTG ATA ATG 

Gly Thr Leu Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met 
485 490 495 500 

GCA AAG GGA GGC ATT GTC CGA ACT GGG ACT CAT GGA CTG CTT GTG AAG 

Ala Lys Gly Gly He Val Arg Thr Gly Thr His Gly Leu Leu Val Lys 

505 510 515 

CAG GAA GAC ATG AAG GGC CAT TTT TCC ATC TCA ATC CCT GTG AAG TCA 

Gin Glu Asp Met Lys Gly His Phe Ser He Ser He Pro Val Lys Ser 
520 525 530 

GAC ATT GCT CCT GTC GCT CGG TTG CTC ATC TAT GCT GTT TTA CCT ACC 

Asp He Ala Pro Val Ala Arg Leu Leu lie Tyr Ala Val Leu Pro Thr 
535 540 545 

GGG GAC GTG ATT GGG GAT TCT GCA AAA TAT GAT GTT GAA AAT TGT CTG 

Gly Asp Val He Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu 
550 555 560 

GCC AAC AAG GTG GAT TTG AGC TTC AGC CCA TCA CAA AGT CTC CCA GCC 

Ala Asn Lys Val Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala 
565 570 575 580 

TCA CAC GCC CAC CTG CGA GTC ACA GCG GCT CCT CAG TCC GTC TGC GCC 

Ser His Ala His Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala 

585 590 595 

CTC CGT GCT GTG GAC CAA AGC GTG CTG CTC ATG AAG CCT GAT GCT GAG 

Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu 
600 605 610 

CTC TCG GCG TCC TCG GTT TAC AAC CTG CTA CCA GAA AAG GAC CTC ACT 

Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr 
615 620 625 

GGC TTC CCT GGG CCT TTG AAT GAC CAG GAC GAT GAA GAC TGC ATC AAT 

Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn 
630 635 640 

CGT CAT AAT GTC TAT ATT AAT GGA ATC ACA TAT ACT CCA GTA TCA AGT 

Arg His Asn Val Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser 
645 650 655 660 

ACA AAT GAA AAG GAT ATG TAC AGC TTC CTA GAG GAC ATG GGC TTA AAG 

Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys 

665 670 675 

GCA TTC ACC AAC TCA AAG ATT CGT AAA CCC AAA ATG TGT CCA CAG CTT 

Ala Phe Thr Asn Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu 
680 685 690 
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CAA CAG TAT GAA ATG CAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT 2167 
Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tvr 
695 700 705 

GAG TCA GAT GTA ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA 2215 
Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu Val His Val Glu 
710 715 720 

GAG CCT CAC ACG GAG ACC GTA CGA AAG TAC TTC CCT GAG ACA TGG ATC 22 63 
Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He 
725 730 735 P 740 

TGG GAT TTG GTG GTG GTA AAC TCA GCA GGG GTG GCT GAG GTA GGA GTA 2311 
Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val 
745 750 755 

ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG 2359 
Thr Val Pro Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu 
750 765 770 

TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC 24 07 
Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arq Ala 
775 780 785 

TTC CAG CCC TTC TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT 2455 
Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg 
790 795 800 

GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA 2 503 

Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys 
805 810 815 820 

TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTT GCT 2551 
Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala 
825 830 835 

GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG 25 99 
Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly 
840 845 850 

CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG 2647 
Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val 
855 860 865 

AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG 2 695 
Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly 
870 875 880 

ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC 27 43 
Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He 
885 890 895 900 

AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC 27 91 
Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe 
905 910 ' 915 

AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC 2839 
Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser 
920 925 930 
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CTG AAA. CTG CCA CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC 
Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val 
935 940 945 

TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT 
Ser Val Leu Gly Asp lie Leu Gly Ser Ala Met Gin Asn Thr Gin Asn 
950 955 960 

CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT 
Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe 
965 970 975 980 

GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT 
Ala Pro Asn lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu 
985 990 995 

ACT CCA GAG GTC AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC 
Thr Pro Glu Val Lys Ser Lys Ala lie Gly Tyr Leu Asn Thr Gly Tyr 
1000 1005 ' 1010 

CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT 
Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe 
1015 1020 * 1025 

GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC ACC TGG CTC ACA GCC TTT 
Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe 
1030 1035 1040 

GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA 
Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He Phe He Asp Glu 
1045 1050 1055 1060 

GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC 
Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp 
1065 1070 1075 

AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG 
Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys 
1080 1085 1090 

GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC 
Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala 
1095 1100 1105 

CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC 
Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala 
1110 1115 1120 

CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT 
Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His 
1125 1130 1135 1140 

GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG 
Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu 
1145 1150 ' 1155 



GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG 
Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu 
1160 1165 1170 
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GAA GCT GTG AAG AAA GAC AAC TCT GTC CRT TGG GAG CGC CCT CAG AAA 3607 
Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys 
1175 1180 * 1185 

CCC AAG GCA CCA GTG GGG CAT TTT TAC GAA CCC CAG GCT CCC TCT GCT 3655 
Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala 
1190 1195 1200 

GAG GTG GAG ATG ACA TCC TAT GTG CTC CTC GCT TAT CTC ACG GCC CAG 3703 
Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin 
1205 1210 1215 1220 

CCA GCC CCA ACC TCG GAG GAC CTG ACC TCT GCA ACC AAC ATC GTG AAG 3751 
Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn lie Val Lys 
1225 1230 1235 

TGG ATC ACG AAG CAG CAG AAT GCC CAG GGC GGT TTC TCC TCC ACC CAG 3799 
Trp lie Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin 
1240 1245 " " 1250 

GAC ACA GTG GTG GCT CTC CAT GCT CTG TCC AAA TAT GGA GCC GCC ACA 384 7 
Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr 
1255 1260 1265 

TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG 3895 
Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly 
1270 1275 1280 

ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC AAC AAT CGC CTG TTA CTG 394 3 

Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu 
1285 1290 * 1295 ' 1300 

CAG CAG GTC TCA TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG 3 9 91 

Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val 
1305 1310 1315 

ACA GGA GAA GGA TGT GTC TAC CTC CAG ACC TCC TTG AAA TAC AAT ATT 4 03 9 
Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He 
1320 1325 1330 

CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG 4 087 
Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu 
1335 1340 1345 

CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC 4135 
Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser 
1350 * 1355 1360 

CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC 4183 
Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He 
1365 1370 1375 1380 

GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG 4231 
Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val 
1385 1390 1395 

AAA ATG CTT. GAA AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC 4279 
Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser 
1400 1405 ' 1410 
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AAC CAT GTC TTG ATT TAC CTT GAT AAG GTG TCA AAT CAG ACA CTG AGC 4 327 
Asn His Val Leu lie Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser 
1415 1420 1425 

TTG TTC TTC ACG GTT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA 4 37 5 
Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro 
1430 1435 1440 

GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATC 4 4 23 
Ala lie Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala lie 
1445 1450 1455 1460 

GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCA 4474 
Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 1 

CAAGGCTGAA AAGTGCTTTG CTGGAGTCCT GTTCTCTGAG CTCCACAGAA GACACGTGTT 4 534 
TTTGTATCTT TAAAGACTTG ATGAATAAAC ACTTTTTCTG GTC 4 577 
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Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu 

15 10 15 

His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn 

20 25 30 

Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg 

35 40 45 

Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val 

50 55 60 

Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu 
65 70 75 80 

Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr 

85 90 95 

Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys 

100 105 110 

Ser He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met 

115 120 125 

Asp Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He 

130 135 140 

Gin Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu 
145 150 155 160 

Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe 

165 170 175 

Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr 

180 185 190 

Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val 

195 200 205 

Gin Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn 

210 215 220 

Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His 
225 230 235 240 

Val Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His 

245 250 255 

Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn 

260 265 270 

Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu 

275 280 285 

Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Gin He Gin 

290 295 300 

Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He 
305 310 315 320 

Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe 

325 330 335 

Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys 

340 345 350 

Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala 

355 360 365 

Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe 

370 375 380 

Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val 
385 390 395 400 

Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu 

405 .410 415 

Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser 

420 425 430 

Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly 

435 440 445 

His Thr Gin Thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu 

450 455 460 

Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly 
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465 



470 



475 



480 



Gly lie Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp 

485 490 " 495 

Met Lys Gly His Phe Ser lie Ser lie Pro Val Lys Ser Asp lie Ala 

500 505 510 

Pro Val Ala Arg Leu Leu lie Tyr Ala Val Leu Pro Thr Gly Asp Val 

515 520 525 

lie Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys 

530 535 540 

Val Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala 
545 550 555 560 

His Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala 

565 570 575 

Val Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala 

580 585 590 

Ser Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro 

595 600 * 605 . 

Gly Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys lie Asn Arg His Asn 

610 615 * 620 

Val Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu 
625 630 635 640 

Lys Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr 

645 650 " ' 655 

Asn Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr 

660 665 670 

Glu Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tyr Glu Ser Asp 

675 680 " 685 

Val Met Gly Arg Gly His Ala Arg Leu Val His Val Glu Glu Pro His 

690 "* 695 ' 700 

Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Tro He Trp Asp Leu 
705 710 715 " 720 

Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val Pro 

725 730 " 735 

Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu Asp 

740 " 745 750 

Ala Gly Leu Gly He Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin Pro 

755 ~ 760 ' 765 

Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu Ala 

770 775 780 

Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys He Arg 
785 790 795 800 

Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro Val 

805 810 815 

Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin Thr 

820 825 830 

Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe Thr 

835 840 ~ 845 

Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly Thr Glu Val 

850 855 860 

Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro Leu 
865 870 875 880 

Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser Leu 

885 890 895 

Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys Leu 

900 905 910 

Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val Leu 

915 920 925 

Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu Gin 
930 935 940 
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Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn 
945 950 955 960 

lie Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu 

965 970 975 

Val Lys Ser Lys Ala lie Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin 

980 985 990 

Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg 

995 1000 1005 

Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe Val Leu Lys 

1010 1015 1020 

Thr Phe Ala Gin Ala Arg Ala Tyr lie Phe lie Asp Glu Ala His He 
025 1030 1035 1040 

Thr Gin Ala Leu lie Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys 

1045 1050 1055 

Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val 

1060 1065 1070 

Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu 

1075 1080 1085 

He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala Leu Phe Cys 

1090 1095 1100 

Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His 
105 1110 1115 1120 

Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn 

1125 1130 1135 

Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val 

1140 1145 1150 

Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala 

1155 1160 1165 

Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu 

1170 1175 1180 

Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro 
185 1190 1195 1200 

Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp He Thr 

1205 1210 1215 

Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin Asp Thr Val 

1220 1225 1230 

Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg 

1235 1240 1245 

Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser Ser Gly Thr Phe Ser 

1250 1255 1260 

Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val 
265 1270 1275 1280 

Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu 

1285 1290 1295 

Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu 

1300 1305 _ _ 1310 

Lys Giu' Glu ; Phe Pro -;Phe" AXigLetf CGlylWal 

_/l ^1315" "" " 1320"" ~ 1325" ,"'" 

C^lAsp^Givr^iro' iLys >Ala~ 'His\%h^?Se ij \PMf Gln^l'e S.Sei^LVtr^Ser^ai 

" 1330 """1335 ""' " "134 0^ " 

S er ^T^r^Thr ^Gly ' ' S er-'Ar g ' " Se r *A1 "a^S e'r jpsn Me&fSaTli eV. Val^Asp.rvSL; 

345 1350 " '"_"" ^1355" 1"' J'™.",..'.'.,.^. 1360 

Lys""'t4¥t"Va^ 

'Glu'-Arg^Ser-'Asn '"His Val ' Se'r- ? Arg\.T^ Val 

1380 "_ / '_ " .1385"' "" ^ 1390... 

Leu"ile'Tyr "Leu Asp 'Lys Val -Ser'AsritGln TThr" ; L"eu ^Ser' Leu'' Phe" Phe 
1395 " 1400 " ~ ""i"405" 
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CAGCGGTGCG AGCTCCAGGC CCATGCACTG AGGAGGCGGA AACAAGGGGA GCCCCCAGAG 60 

CTCCATCAAG CCCCCTCCAA AGGCTCCCCT ACCCGGTCCA CGCCCCCCAC CCCCCCTCCC 120 

CGCCTCCTCC CAATTGTGCA TTTTTGCAGC CGGAGGCGGC TCCGAGATGG GGCTGTGAGC 18 0 

TTCGCCCGGG GAGGGGGAAA GAGCAGCGAG GAGTGAAGCG GGGGGGTGGG GTGAAGGGTT 240 

TGGATTTCGG GGCAGGGGGC GCACCCCCGT CAGCAGGCCC TCCCCAAGGG GCTCGGAACT 300 

CTACCTCTTC ACCCACGCCC CTGGTGCGCT TTGCCGAAGG AAAGAATAAG AACAGAGAAG 3 60 

GAGGAGGGGG AAAGGAGGAA AAGGGGGACC CCCCAACTGG GGGGGGTGAA GGAGAGAAGT 4 20 

AGCAGGACCA GAGGGGAAGG GGCTGCTGCT TGCATCAGCC CACACC ATG CTG ACC 475 

Met Leu Thr 
1 

CCG CCG TTG CTC CTG CTG CTG CCC CTG CTC TCA GCT CTG GTC GCG GCG 523 
Pro Pro Leu Leu Leu Leu Leu Pro Leu Leu Ser Ala Leu Val Ala Ala 
5 10 15 

GCT ATC GAC GCC CCT AAG ACT TGC AGC CCC AAG CAG TTT GCC TGC AGA 571 
Ala He Asp Ala Pro Lys Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg 
20 25 30 35 

GAT CAA ATA ACC TGT ATC TCA AAG GGC TGG CGG TGC GAC GGT GAG AGG 619 
Asp Gin He Thr Cys He Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg 
40 45 50 

GAC TGC CCA GAC GGA TCT GAC GAG GCC CCT GAG ATT TGT CCA CAG AGT 6 67 

Asp Cys Pro Asp Gly Ser Asp Glu Ala Pro Glu He Cys Pro Gin Ser 
55 60 65 

AAG GCC CAG CGA TGC CAG CCA AAC GAG CAT AAC TGC CTG GGT ACT GAG 715 
Lys Ala Gin Arg Cys Gin Pro Asn Glu His Asn Cys Leu Gly Thr Glu 
™ 75 80 

CTG TGT GTT CCC ATG TCC CGC CTC TGC AAT GGG GTC CAG GAC TGC ATG 7 63 

Leu Cys Val Pro Met Ser Arg Leu Cys Asn Gly Val Gin Asp Cys Met 
85 90 95 

GAC GGC TCA GAT GAG GGG CCC CAC TGC CGA GAG CTC CAA GGC AAC TGC 811 
Asp Gly Ser Asp Glu Gly Pro His Cys Arg Glu Leu Gin Gly Asn Cys 
100 105 no H5 

TCT CGC CTG GGC TGC CAG CAC CAT TGT GTC CCC ACA CTC GAT GGG CCC 859 ' 

Ser Arg Leu Gly Cys Gin His His Cys Val Pro Thr Leu Asp Gly Pro 
120 125 130 

ACC TGC TAC TGC AAC AGC AGC TTT CAG CTT CAG GCA GAT GGC AAG ACC 907 
Thr Cys Tyr Cys Asn Ser Ser Phe Gin Leu Gin Ala Asp Gly Lys Thr 
135 140 145 

TGC AAA GAT TTT GAT GAG TGC TCA GTG TAC GGC ACC TGC AGC CAG CTA 955 
Cys Lys Asp Phe Asp Glu Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu 
150 155 160 

TGC ACC AAC ACA GAC GGC TCC TTC ATA TGT GGC TGT GTT GAA GGA TAC 1003 
Cys Thr Asn Thr Asp Gly Ser Phe He Cys Gly Cys Val Glu Gly Tyr 
165 170 175 

CTC CTG CAG CCG GAT AAC CGC TCC TGC AAG GCC AAG AAC GAG CCA GTA 1051 
Leu Leu Gin Pro Asp Asn Arg Ser Cys Lys Ala Lys Asn Glu Pro Val 
180 185 190 195 
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GAC CGG CCC CCT GTG CTG TTG ATA GCC AAC TCC CAG AAC ATC TTG GCC 

Asp Arg Pro Pro Val Leu Leu He Ala Asn Ser Gin Asn He Leu Ala 
200 205 210 

ACG TAC CTG AGT GGG GCC CAG GTG TCT ACC ATC ACA CCT ACG AGC ACG 

Thr Tyr Leu Ser Gly Ala Gin Val Ser Thr He Thr Pro Thr Ser Thr 
215 220 225 

CGG CAG ACC ACA GCC ATG GAC TTC AGC TAT GCC AAC GAG ACC GTA TGC 

Arg Gin Thr Thr Ala Met Asp Phe Ser Tyr Ala Asn Glu Thr Val Cys 
230 235 240 

TGG GTG CAT GTT GGG GAC AGT GCT GCT CAG ACG CAG CTC AAG TGT GCC 

Trp Val His Val Gly Asp Ser Ala Ala Gin Thr Gin Leu Lys Cys Ala 
245 250 255 

CGC ATG CCT GGC CTA AAG GGC TTC GTG GAT GAG CAC ACC ATC AAC ATC 

Arg Met Pro Gly Leu Lys Gly Phe Val Asp Glu His Thr He Asn He 

260 265 270 275 

TCC CTC AGT CTG CAC CAC GTG GAA CAG ATG GCC ATC GAC TGG CTG ACA 

Ser Leu Ser Leu His His Val Glu Gin Met Ala He Asp Trp Leu Thr 
280 285 290 

GGC AAC TTC TAC TTT GTG GAT GAC ATC GAT GAT AGG ATC TTT GTC TGC 

Gly Asn Phe Tyr Phe Val Asp Asp He Asp Asp Arg He Phe Val Cys 
295 300 305 

AAC AGA AAT GGG GAC ACA TGT GTC ACA TTG CTA GAC CTG GAA CTC TAC 

Asn Arg Asn Gly Asp Thr Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr 
310 315 320 

AAC CCC AAG GGC ATT GCC CTG GAC CCT GCC ATG GGG AAG GTG TTT TTC 

Asn Pro Lys Gly He Ala Leu Asp Pro Ala Met Gly Lys Val Phe Phe 
325 330 335 

ACT GAC TAT GGG CAG ATC CCA AAG GTG GAA CGC TGT GAC ATG GAT GGG 

Thr Asp Tyr Gly Gin He Pro Lys Val Glu Arg Cys Asp Met Asp Gly 

340 345 350 355 

CAG AAC CGC ACC AAG CTC GTC GAC AGC AAG ATT GTG TTT CCT CAT GGC 

Gin Asn Arg Thr Lys Leu Val Asp Ser Lys He Val Phe Pro His Gly 
360 365 370 

ATC ACG CTG GAC CTG GTC AGC CGC CTT GTC TAC TGG GCA GAT GCC TAT 

He Thr- Leu Asp Leu Val Ser Arg Leu Val Tyr Trp Ala Asp Ala Tyr 
375 380 385 

CTG GAC TAT ATT GAA GTG GTG GAC TAT GAG GGC AAG GGC CGC CAG ACC 

Leu Asp Tyr He Glu Val Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr 
390 395 400 

ATC ATC CAG GGC ATC CTG ATT GAG CAC CTG TAC GGC CTG ACT GTG TTT 

He He Gin Gly He Leu He Glu His Leu Tyr Gly Leu Jhr Val Phe 
405 410 415 

GAG AAT TAT CTC TAT GCC ACC AAC TCG GAC AAT GCC AAT GCC CAG CAG 

Glu Asn Tyr Leu Tyr Ala Thr Asn Ser Asp Asn Ala Asn Ala Gin Gin 

420 425 430 435 
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AAG ACG AGT GTG ATC CGT GTG AAC CGC TTT AAC AGC ACC GAG TAC CAG 1819 

Lys Thr Ser Val He Arg Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin 
440 445 450 

GTT GTC ACC CGG GTG GAC AAG GGT GGT GCC CTC CAC ATC TAC CAC CAG 18 67 

Val Val Thr Arg Val Asp Lys Gly Gly Ala Leu His He Tyr His Gin 

455 460 465 

AGG CGT CAG CCC CGA GTG AGG AGC CAT GCC TGT GAA AAC GAC CAG TAT 1915 

Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu Asn Asp Gin Tyr 

470 475 480 

GGG AAG CCG GGT GGC TGC TCT GAC ATC TGC CTG CTG GCC AAC AGC CAC 19 63 

Gly Lys Pro Gly Gly Cys Ser Asp He Cys Leu Leu Ala Asn Ser His 
485 490 495 

AAG GCG CGG ACC TGC CGC TGC CGT TCC GGC TTC AGC CTG GGC AGT GAC 2011 

Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp 
500 505 510 515 

GGG AAG TCA TGC AAG AAG CCG GAG CAT GAG CTG TTC CTC GTG TAT GGC 2059 

Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe Leu Val Tyr Gly 
520 525 530 

AAG GGC CGG CCA GGC ATC ATC CGG GGC ATG GAT ATG GGG GCC AAG GTC 2107 

Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met Gly Ala Lys Val 

535 540 545 

CCG GAT GAG CAC ATG ATC CCC ATT GAA AAC CTC ATG AAC CCC CGA GCC 2155 

Pro Asp Glu His Met He Pro He Glu Asn Leu Met Asn Pro Arg Ala 

550 555 560 

CTG GAC TTC CAC GCT GAG ACC GGC TTC ATC TAC TTT GCC GAC ACC ACC 2203 

Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala Asp Thr Thr 
565 570 575 

AGC TAC CTC ATT GGC CGC CAG AAG ATT GAT GGC ACT GAG CGG GAG ACC 2 251 . 

Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr Glu Arg Glu Thr 
580 585 590 595 

ATC CTG AAG GAC GGC ATC CAC AAT GTG GAG GGT GTG GCC GTG GAC TGG 22 99 

He Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala Val Asp Trp 
600 605 610 

ATG GGA GAC AAT CTG TAC TGG ACG GAC GAT GGG CCC AAA AAG ACA ATC 2347 

Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys Lys Thr He 

615 620 625 

AGC GTG GCC AGG CTG GAG AAA GCT GCT CAG ACC CGC AAG ACT TTA ATC 2395 

Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg Lys Thr Leu He 

630 635 640 

GAG GGC AAA ATG ACA CAC CCC AGG GCT ATT GTG GTG GAT CCA CTC AAT 24 43 

Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp Pro Leu Asn 
645 650 655 

GGG TGG ATG TAC TGG ACA GAC TGG GAG GAG GAC CCC AAG GAC AGT CGG 24 91 

Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg 
660 665 670 675 
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CGT GGG CGG CTG GAG AGG GCG TGG ATG GAT GGC TCA CAC CGA GAC ATC 2539 
Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser His Arg Asp lie 
680 685 690 

TTT GTC ACC TCC AAG ACA GTG CTT TGG CCC AAT GGG CTA AGC CTG GAC 2587 
Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly Leu Ser Leu Asp 
695 700 705 

ATC CCG GCT GGG CGC CTC TAC TGG GTG GAT GCC TTC TAC GAC CGC ATC 2635 
He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe Tyr Asp Arg He 
710 715 720 

GAG ACG ATA CTG CTC AAT GGC ACA GAC CGG AAG ATT GTG TAT GAA GGT 2 683 

Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He Val Tyr Glu Gly 
725 730 735 

CCT GAG CTG AAC CAC GCC TTT GGC CTG TGT CAC CAT GGC AAC TAC CTC 2731 
Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His Gly Asn Tyr Leu 
740 745 750 755 

TTC TGG ACT GAG TAT CGG AGT GGC AGT GTC TAC CGC TTG GAA CGG GGT 2779 
Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg Leu Glu Arg Gly 
760 765 770 

GTA GGA GGC GCA CCC CCC ACT GTG ACC CTT CTG CGC AGT GAG CGG CCC 2 827 

Val Gly Gly Ala Pro Pro Thr Val Thr Leu Leu Arg Ser Glu Arg Pro 
775 780 785 

CCC ATC TTT GAG ATC CGA ATG TAT GAT GCC CAG CAG CAG CAA GTT GGC 2875 
Pro He Phe Glu He Arg Met Tyr Asp Ala Gin Gin Gin Gin Val Gly 
790 795 800 

ACC AAC AAA TGC CGG GTG AAC AAT GGC GGC TGC AGC AGC CTG TGC TTG 2 923 

Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser Ser Leu Cys Leu 
805 810 815 

GCC ACC CCT GGG AGC CGC CAG TGC GCC TGT GCT GAG GAC CAG GTG TTG 2971 
Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu Asp Gin Val Leu 
820 825 830 835 

GAC GCA GAC GGC GTC ACT TGC TTG GCG AAC CCA TCC TAC GTG CCT CCA 3019 
Asp Ala Asp Gly Val Thr Cys Leu Ala Asn Pro Ser Tyr Val Pro Pro 
840 845 850 

CCC CAG TGC CAG CCA GGC GAG TTT GCC TGT GCC AAC AGC CGC TGC ATC 3067 
Pro Gin Cys Gin Pro Gly Glu Phe Ala Cys Ala Asn Ser Arg Cys He 
855 860 865 

CAG GAG CGC TGG AAG TGT GAC GGA GAC AAC GAT TGC CTG GAC AAC AGT 3115 
Gin Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser 
870 875 880 

GAT GAG GCC CCA GCC CTC TGC CAT CAG CAC ACC TGC CCC TCG GAC CGA 3163 
Asp Glu Ala Pro Ala Leu Cys His Gin His Thr Cys Pro Ser Asp Arg 
885 890 895 

TTC AAG TGC GAG AAC AAC CGG TGC ATC CCC AAC CGC TGG CTC TGC GAC 3211 
Phe Lys Cys Glu Asn Asn Arg Cys He Pro Asn Arg Trp Leu Cys Asp 
900 905 910 915 
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GGG GAC AAT GAC TGT GGG AAC AGT GAA GAT GAG TCC AAT GCC ACT TGT 
Gly Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu Ser Asn Ala Thr Cys 
920 925 930 

TCA GCC CGC ACC TGC CCC CCC AAC CAG TTC TCC TGT GCC AGT GGC CGC 
Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys Ala Ser Gly Arg 
935 940 945 

TGC ATC CCC ATC TCC TGG ACG TGT GAT CTG GAT GAC GAC TGT GGG GAC 
Cys lie Pro lie Ser Trp Thr Cys Asp Leu Asp Asp Asp Cys Gly Asp 
950 955 960 

CGC TCT GAT GAG TCT GCT TCG TGT GCC TAT CCC ACC TGC TTC CCC CTG 
Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr Cys Phe Pro Leu 
965 970 975 

ACT CAG TTT ACC TGC AAC AAT GGC AGA TGT ATC AAC ATC AAC TGG AGA 
Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn lie Asn Trp Arg 
980 985 990 995 

TGC GAC AAT GAC AAT GAC TGT GGG GAC AAC AGT GAC GAA GCC GGC TGC 
Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Ala Gly Cys 
1000 1005 1010 

AGC CAC TCC TGT TCT AGC ACC CAG TTC AAG TGC AAC AGC GGG CGT TGC 
Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys 
1015 1020 1025 

ATC CCC GAG CAC TGG ACC TGC GAT GGG GAC AAT GAC TGC GGA GAC TAC 
lie Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp Cys Gly Asp Tyr 
1030 1035 1040 

AGT GAT GAG ACA CAC GCC AAC TGC ACC AAC CAG GCC ACG AGG CCC CCT 
Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro 
1045 1050 1055 

GGT GGC TGC CAC ACT GAT GAG TTC CAG TGC CGG CTG GAT GGA CTA TGC 
Gly Gly Cys His Thr Asp Glu Phe Gin Cys Arg Leu Asp Gly Leu Cys 
1060 1065 1070 1075 

ATC CCC CTG CGG TGG CGC TGC GAT GGG GAC ACT GAC TGC ATG GAC TCC 
lie Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp Cys Met Asp Ser 
1080 1085 1090 

AGC GAT GAG AAG AGC TGT GAG GGA GTG ACC CAC GTC TGC GAT CCC AGT 
Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val Cys Asp Pro Ser 
1095 1100 1105 

GTC AAG TTT GGC TGC AAG GAC TCA GCT CGG TGC ATC AGC AAA GCG TGG 
Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys lie Ser Lys Ala Trp 
1110 1115 1120 

GTG TGT GAT GGC GAC AAT GAC TGT GAG GAT AAC TCG GAC GAG GAG AAC 
Val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn 
1125 1130 1135 

TGC GAG TCC CTG GCC TGC AGG CCA CCC TCG CAC CCT TGT GCC AAC AAC 
Cys Glu Ser Leu Ala Cys Arg Pro Pro Ser His Pro Cys Ala Asn Asn 
1140 1145 1150 1155 
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ACC TCA GTC TGC CTG CCC CCT GAC AAG CTG TGT GAT GGC AAC GAC GAC 3 97 9 
Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp Gly Asn Asp Asp 
1160 1165 1170 

TGT GGC GAC GGC TCA GAT GAG GGC GAG CTC TGC GAC CAG TGC TCT CTG 4 027 

Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu 
1175 1180 1185 

AAT AAC GGT GGC TGC AGC CAC AAC TGC TCA GTG GCA CCT GGC GAA GGC 4 07 5 
Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala Pro Gly Glu Gly 
1190 1195 1200 

ATT GTG TGT TCC TGC CCT CTG GGC ATG GAG CTG GGG CCC GAC AAC CAC 4123 
lie Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly Pro Asp Asn His 
1205 1210 1215 

ACC TGC CAG ATC CAG AGC TAC TGT GCC AAG CAT CTC AAA TGC AGC CAA 4171 
Thr Cys Gin lie Gin Ser Tyr Cys Ala Lys His Leu Lys Cys Ser Gin 
1220 1225 1230 1235 

AAG TGC GAC CAG AAC AAG TTC AGC GTG AAG TGC TCC TGC TAC GAG GGC 4 219 
Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly 
1240 1245 1250 

TGG GTC CTG GAA CCT GAC GGC GAG AGC TGC CGC AGC CTG GAC CCC TTC 4 2 67 
Trp Val Leu Glu Pro Asp Gly Glu Ser Cys Arg Ser Leu Asp Pro Phe 
1255 1260 1265 

AAG CCG TTC ATC ATT TTC TCC AAC CGC CAT GAA ATC CGG CGC ATC GAT 4 315 
Lys Pro Phe lie lie Phe Ser Asn Arg His Glu lie Arg Arg lie Asp 
1270 1275 1280 

CTT CAC AAA GGA GAC TAC AGC GTC CTG GTG CCC GGC CTG CGC AAC ACC 4 3 63 
Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly Leu Arg Asn Thr 
1285 1290 1295 

ATC GCC CTG GAC TTC CAC CTC AGC CAG AGC GCC CTC TAC TGG ACC GAC 4 411 
lie Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp 
1300 1305 1310 1315 

GTG GTG GAG GAC AAG ATC TAC CGC GGG AAG CTG CTG GAC AAC GGA GCC 4 4 59 

Val Val Glu Asp Lys lie Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala 
1320 1325 1330 

CTG ACT AGT TTC GAG GTG GTG ATT CAG TAT GGC CTG GCC ACA CCC GAG 4 507 

Leu Thr Ser Phe Glu Val Val lie Gin Tyr Gly Leu Ala Thr Pro Glu 
1335 1340 1345 

GGC CTG GCT GTA GAC TGG ATT GCA GGC AAC ATC TAC TGG GTG GAG AGT 4 555 
Gly Leu Ala Val Asp Trp lie Ala Gly Asn He Tyr Trp Val Glu Ser 
1350 1355 1360 

AAC CTG GAT CAG ATC GAG GTG GCC AAG CTG GAT GGG ACC CTC CGG ACC 4 603 
Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly Thr Leu Arg Thr 
1365 1370 1375 

ACC CTG CTG GCC GGT GAC ATT GAG CAC CCA AGG GCA ATC GCA CTG GAT 4 651 

Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala He Ala Leu Asp 
1380 1385 1390 1395 
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CCC CGG GAT GGG ATC CTG TTT TGG ACA GAC TGG GAT GCC AGC CTG CCC 4 699 
Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro 
1400 1405 1410 

CGC ATT GAG GCA GCC TCC ATG AGT GGG GCT GGG CGC CGC ACC GTG CAC 47 4 7 
Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg Arg Thr Val His 
1415 1420 1425 

CGG GAG ACC GGC TCT GGG GGC TGG CCC AAC GGG CTC ACC GTG GAC TAC 4795 
Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu Thr Val Asp Tyr 
1430 1435 1440 

CTG GAG AAG CGC ATC CTT TGG ATT GAC GCC AGG TCA GAT GCC ATT TAC 4 843 
Leu Glu Lys Arg lie Leu Trp lie Asp Ala Arg Ser Asp Ala lie Tyr 
1445 1450 1455 

TCA GCC CGT TAC GAC GGC TCT GGC CAC ATG GAG GTG CTT CGG GGA CAC 4 891 
Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu Arg Gly His 
1460 ' 1465 1470 1475 

GAG TTC CTG TCG CAC CCG TTT GCA GTG ACG CTG TAC GGG GGG GAG GTC 4 939 
Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly Gly Glu Val 
1480 1485 1490 

TAC TGG ACT GAC TGG CGA AC A AAC ACA CTG GCT AAG GCC AAC AAG TGG 4 987 
Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp 
1495 1500 ~ 1505 

ACC GGC CAC AAT GTC ACC GTG GTA CAG AGG ACC AAC ACC CAG CCC TTT 5035 
Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr Gin Pro Phe 
1510 1515 ' 1520 

GAC CTG CAG GTG TAC CAC CCC TCC CGC CAG CCC ATG GCT CCC AAT CCC 5083 
Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala Pro Asn Pro 
1525 1530 1535 

TGT GAG GCC AAT GGG GGC CAG GGC CCC TGC TCC CAC CTG TGT CTC ATC 5131 
Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu Cys Leu lie 
1540 1545 1550 1555 

AAC TAC AAC CGG ACC GTG TCC TGC GCC TGC CCC CAC CTC ATG AAG CTC 517 9 
Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu Met Lys Leu 
1560 1565 1570 

CAC AAG GAC AAC ACC ACC TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC 5227 
His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr 
1575 1580 1585 

GCA CGT CAG ATG GAG ATC CGA GGT GTG GAC CTG GAT GCT CCC TAC TAC 5275 
Ala Arg Gin Met Glu lie Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr 
1590 1595 1600 

AAC TAC ATC ATC TCC TTC ACG GTG CCC GAC ATC GAC AAC GTC ACA GTG 5323 
Asn Tyr lie lie Ser Phe Thr Val Pro Asp lie Asp Asn Val Thr Val 
1605 1610 1615 

CTA GAC TAC GAT GCC CGC GAG CAG CGT GTG TAC TGG TCT GAC GTG CGG 5371 
Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp Ser Asp Val Arg 
1620 1625 1630 1635 
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ACA CAG GCC ATC AAG CGG GCC TTC ATC AAC GGC ACA GGC GTG GAG ACA 
Thr Gin Ala lie Lys Arg Ala Phe lie Asn Gly Thr Gly Val Glu Thr 
1640 1645 1650 

GTC GTC TCT GCA GAC TTG CCA AAT GCC CAC GGG CTG GCT GTG GAC TGG 
Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu Ala Val Asp Trp 
1655 1660 " 1665 

GTC TCC CGA AAC CTG TTC TGG ACA AGC TAT GAC ACC AAT AAG AAG CAG 
Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin 
1670 1675 * 1680 

ATC AAT GTG GCC CGG CTG GAT GGC TCC TTC AAG AAC GCA GTG GTG CAG 
lie Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn Ala Val Val Gin 
1685 1690 1695 

GGC CTG GAG CAG CCC CAT GGC CTT GTC GTC CAC CCT CTG CGT GGG AAG 
Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro Leu Arg Gly Lys 
1700 1705 1710 1715 

CTC TAC TGG ACC GAT GGT GAC AAC ATC AGC ATG GCC AAC ATG GAT GGC 
Leu Tyr Trp Thr Asp Gly Asp Asn lie Ser Met Ala Asn Met Asp Gly 
1720 1725 1730 

AGC AAT CGC ACC CTG CTC TTC AGT GGC CAG AAG GGC CCC GTG GGC CTG 
Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro Val Gly Leu 
1735 1740 ' ' 1745 

GCT ATT GAC TTC CCT GAA AGC AAA CTC TAC TGG ATC AGC TCC GGG AAC 
Ala lie Asp Phe Pro Glu Ser Lys Leu Tyr Trp lie Ser Ser Gly Asn 
1750 1755 1760 

CAT ACC ATC AAC CGC TGC AAC CTG GAT GGG AGT GGG CTG GAG GTC ATC 
His Thr lie Asn Arg Cys Asn Leu Asp Gly Ser Gly Leu Glu Val lie 
1765 1770 " 1775 

GAT GCC ATG CGG AGC CAG CTG GGC AAG GCC ACC GCC CTG GCC ATC ATG 
Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu Ala He Met 
1780 1785 1790 1795 

GGG GAC AAG CTG TGG TGG GCT GAT CAG GTG TCG GAA AAG ATG GGC ACA 
Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys Met Gly Thr 
1800 1805 1810 

TGC AGC AAG GCT GAC GGC TCG GGC TCC GTG GTC CTT CGG AAC AGC ACC 
Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg Asn Ser Thr 
1815 1820 1825 

ACC CTG GTG ATG CAC ATG AAG GTC TAT GAC GAG AGC ATC CAG CTG GAC 
Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser He Gin Leu Asp 
1830 1835 1840 

CAT AAG GGC ACC AAC CCC TGC AGT GTC AAC AAC GGT GAC TGC TCC CAG 
His Lys Gly Thr Asn Pro Cys Ser Val Asn Asn Gly Asp Cys Ser Gin 
1845 1850 1855 

CTC TGC CTG CCC ACG TCA GAG ACG ACC CGC TCC TGC ATG TGC ACA GCC 
Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys Met Cys Thr Ala 
1860 1865 1870 1875 



FIG. 8a 



8449-128 



(SHEETS OF Xi 



GGC TAT AGC CTC CGG AGT GGC CAG CAG GCC TGC GAG GGC GTA GGT TCC 6139 
Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly Val Gly Ser 
1880 1885 1890 

TTT CTC CTG TAC TCT GTG CAT GAG GGA ATC AGG GGA ATT CCC CTG GAT 6187 
Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly lie Pro Leu Asp 
1895 1900 1905 

CCC AAT GAC AAG TCA GAT GCC CTG GTC CCA GTG TCC GGG ACC TCG CTG 6235 
Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly Thr Ser Leu 
1910 1915 1920 

GCT GTC GGC ATC GAC TTC CAC GCT GAA AAT GAC ACC ATC TAC TGG GTG 6283 
Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr lie Tyr Trp Val 
1925 1930 1935 

GAC ATG GGC CTG AGC ACG ATC AGC CGG GCC AAG CGG GAC CAG ACG TGG 6331 
Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp Gin Thr Trp 
1940 1945 ' 1950 " " 1955 

CGT GAA GAC GTG GTG ACC AAT GGC ATT GGC CGT GTG GAG GGC ATT GCA 637 9 
Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu Gly He Ala 
1960 1965 1970 

GTG GAC TGG ATC GCA GGC AAC ATC TAC TGG AC A GAC CAG GGC TTT GAT 64 2 7 

Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp Gin Gly Phe Asp 
1975 1980 " 1985 

GTC ATC GAG GTC GCC CGG CTC AAT GGC TCC TTC CGC TAC GTG GTG ATC 64 7 5 

Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr Val Val He 
1990 1995 2000 

TCC CAG GGT CTA GAC AAG CCC CGG GCC ATC ACC GTC CAC CCG GAG AAA 6523 
Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val His Pro Glu Lys 
2005 2010 2015 

GGG TAC TTG TTC TGG ACT GAG TGG GGT CAG TAT CCG CGT ATT GAG CGG 6571 
Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg He Glu Arg 
2020 2025 2030 2035 

TCT CGG CTA GAT GGC ACG GAG CGT GTG GTG CTG GTC AAC GTC AGC ATC 6619 
Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn Val Ser He 
2040 2045 2050 

AGC TGG CCC AAC GGC ATC TCA GTG GAC TAC CAG GAT GGG AAG CTG TAC 6667 
Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Asp Gly Lys Leu Tyr 
2055 2060 2065 

TGG TGC GAT GCA CGG ACA GAC AAG ATT GAA CGG ATC GAC CTG GAG ACA 6715 
Trp Cys Asp Ala Arg Thr Asp Lys He Glu Arg He Asp Leu Glu Thr 
2070 * 2075 " 2080 

GGT GAG AAC CGC GAG GTG GTT CTG TCC AGC AAC AAC ATG GAC ATG TTT 67 63 

Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met Asp Met Phe 
2085 2090 2095 

TCA GTG TCT GTG TTT GAG GAT TTC ATC TAC TGG AGT GAC AGG ACT CAT 6811 
Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser Asp Arg Thr His 
2100 2105 2110 2115 
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GCC AAC GGC TCT ATC AAG CGC GGG AGC AAA 
Ala Asn Gly Ser lie Lys Arg Gly Ser Lys 
2120 2125 

GTG CCC CTG CGA ACC GGC ATC GGC GTC CAG 
Val Pro Leu Arg Thr Gly He Gly Val Gin 
2135 2140 

TTC AAC CGG GAC CGG CAG AAA GGC ACC AAC 
Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn 
2150 2155 

GGC GGG TGC CAG CAG CTG TGC CTG TAC CGG 
Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg 
2165 2170 

TGC GCC TGT GCC CAC GGG ATG CTG GCT GAA 
Cys Ala Cys Ala His Gly Met Leu Ala Glu 
2180 2185 2190 2195 

GAG TAT GCC GGC TAC CTG CTC TAC TCA GAG CGC ACC ATT CTC AAG AGT 7099 
Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr He Leu Lys Ser 
2200 2205 2210 

ATC CAC CTG TCG GAT GAG CGC AAC CTC AAT GCG CCC GTG CAG CCC TTC 7147 
He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val Gin Pro Phe 
2215 2220 2225 

GAG GAC CCT GAG CAC ATG AAG AAC GTC ATC GCC CTG GCC TTT GAC TAC 7195 
Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu Ala Phe Asp Tyr 
2230 2235 2240 

CGG GCA GGC ACC TCT CCG GGC ACC CCC AAT CGC ATC TTC TTC AGC GAC 724 3 

Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He Phe Phe Ser Asp 
2245 2250 2255 

ATC CAC TTT GGG AAC ATC CAA CAG ATC AAC GAC GAT GGC TCC AGG AGG 72 91 

He His Phe Gly Asn He Gin Gin He Asn Asp Asp Gly Ser Arg Arg 
2260 2265 2270 2275 

ATC ACC ATT GTG GAA AAC GTG GGC TCC GTG GAA GGC CTG GCC TAT CAC 7339 
He Thr He Val Glu Asn Val Gly Ser Val Glu Gly Leu Ala Tyr His 
2280 2285 2290 

CGT GGC TGG GAC ACT CTC TAT TGG ACA AGC TAC ACG ACA TCC ACC ATC 7 387 
Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He 
2295 2300 2305 

ACG CGC CAC ACA GTG GAC CAG ACC CGC CCA GGG GCC TTC GAG CGT GAG 7 4 35 

Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu 
2310 2315 2320 

ACC GTC ATC ACT ATG TCT GGA GAT GAC CAC CCA CGG GCC TTC GTT TTG 74 83 
Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg Ala Phe Val Leu 
2325 2330 2335 

GAC GAG TGC CAG AAC CTC ATG TTC TGG ACC AAC TGG AAT GAG CAG CAT 7 531 
Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp Asn Glu Gin His 
2340 2345 2350 2355 



GAC AAT GCC ACA GAC TCC 68 59 

Asp Asn Ala Thr Asp Ser 
2130 

CTT AAA GAC ATC AAA GTC 6907 

Leu Lys Asp He Lys Val 
2145 

GTG TGC GCG GTG GCC AAT 6955 

Val Cys Ala Val Ala Asn 
2160 

GGC CGT GGG CAG CGG GCC 7003 

Gly Arg Gly Gin Arg Ala 
2175 

GAC GGA GCA TCG TGC CGC 7051 

Asp Gly Ala Ser Cys Arg 
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CCC AGC ATC ATG CGG GCG GCG CTC TCG GGA GCC AAT GTC CTG ACC CTT 
Pro Ser He Met Arg Ala Ala Leu Ser Gly Ala Asn Val Leu Thr Leu 
236 0 2365 2370 

ATC GAG AAG GAC ATC CGT ACC CCC AAT GGC CTG GCC ATC GAC CAC CGT 
He Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala He Asp His Arq 
2375 2380 2385 

GCC GAG AAG CTC TAC TTC TCT GAC GCC ACC CTG GAC AAG ATC GAG CGG 
Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys He Glu Arg 
2390 2395 2400 

TGC GAG TAT GAC GGC TCC CAC CGC TAT GTG ATC CTA AAG TCA GAG CCT 
Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys Ser Glu Pro 
2405 2410 2415 

GTC CAC CCC TTC GGG CTG GCC GTG TAT GGG GAG CAC ATT TTC TGG ACT 
Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His He Phe Trp Thr 
2420 2425 2430 2435 

GAC TGG GTG CGG CGG GCA GTG CAG CGG GCC AAC AAG CAC GTG GGC AGC 
Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His Val Gly Ser 
2440 2445 2450 

AAC ATG AAG CTG CTG CGC GTG GAC ATC CCC CAG CAG CCC ATG GGC ATC 
Asn Met Lys Leu Leu Arg Val Asp He Pro Gin Gin Pro Met Gly He 
2455 2460 2465 

ATC GCC GTG GCC AAC GAC ACC AAC AGC TGT GAA CTC TCT CCA TGC CGA 
He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg 
2470 2475 2480 

ATC AAC AAC GGT GGC TGC CAG GAC CTG TGT CTG CTC ACT CAC CAG GGC 
He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr His Gin Gly 
2485 2490 2495 

CAT GTC AAC TGC TCA TGC CGA GGG GGC CGA ATC CTC CAG GAT GAC CTC 
His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu Gin Asp Asp Leu 
2500 2505 2510 2515 

ACC TGC CGA GCG GTG AAT TCC TCT TGC CGA GCA CAA GAT GAG TTT GAG 
Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu 
2520 2525 2530 

TGT GCC AAT GGC GAG TGC ATC AAC TTC AGC CTG ACC TGC GAC GGC GTC 
Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys Asp Gly Val 
2535 2540 2545 

CCC CAC TGC AAG GAC AAG TCC GAT GAG AAG CCA TCC TAC TGC AAC TCC 
Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser 
2550 2555 2560 

CGC CGC TGC AAG AAG ACT TTC CGG CAG TGC AGC AAT GGG CGC TGT GTG 
Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly Arg Cys Val 
2565 2570 2575 

TCC AAC ATG CTG TGG TGC AAC GGG GCC GAC GAC TGT GGG GAT GGC TCT 
Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly Asp Gly Ser 
2580 2585 2590 2595 
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GAC GAG ATC CCT TGC AAC AAG ACA GCC TGT GGT GTG GGC GAG TTC CGC 8299 
Asp Glu lie Pro Cys Asn Lys Thr Ala Cys Gly Val Gly Glu Phe Arg 
2600 2605 2610 

TGC CGG GAC GGG ACC TGC ATC GGG AAC TCC AGC CGC TGC AAC CAG TTT 8347 
Cys Arg Asp Gly Thr Cys He Gly Asn Ser Ser Arg Cys Asn Gin Phe 
2615 2620 2625 

GTG GAT TGT GAG GAC GCC TCA GAT GAG ATG AAC TGC AGT GCC ACC GAC 8395 
Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys Ser Ala Thr Asp 
2630 2635 2640 

TGC AGC AGC TAC TTC CGC CTG GGC GTG AAG GGC GTG CTC TTC CAG CCC 84 43 

Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val Leu Phe Gin Pro 
2645 " 2650 * 2655 

TGC GAG CGG ACC TCA CTC TGC TAC GCA CCC AGC TGG GTG TGT GAT GGC 8491 
Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly 
2660 " 2665 2670 ' 2675 

GCC AAT GAC TGT GGG GAC TAC AGT GAT GAG CGC GAC TGC CCA GGT GTG 8539 
Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val 
2680 2685 ' 2690 

AAA CGC CCC AGA TGC CCT CTG AAT TAC TTC GCC TGC CCT AGT GGG CGC 8 587 
Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg 
2695 2700 2705 

TGC ATC CCC ATG AGC TGG ACG TGT GAC AAA GAG GAT GAC TGT GAA CAT 8 635 
Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp Cys Glu His 
2710 2715 2720 

GGC GAG GAC GAG ACC CAC TGC AAC AAG TTC TGC TCA GAG GCC CAG TTT 8 68 3 
Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe 
2725 ~ 2730 2735 

GAG TGC CAG AAC CAT CGC TGC ATC TCC AAG CAG TGG CTG TGT GAC GGC 8731 
Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp Leu Cys Asp Gly 
2740 2745 2750 2755 

AGC GAT GAC TGT GGG GAT GGC TCA GAC GAG GCT GCT CAC TGT GAA GGC 877 9 
Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala His Cys Glu Gly 
2760 2765 2770 

AAG ACG TGC GGC CCC TCC TCC TTC TCC TGC CCT GGC ACC CAC GTG TGC 8827 
Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr His Val Cys 
2775 2780 2785 

GTC CCC GAG CGC TGG CTC TGT GAC GGT GAC AAA GAC TGT GCT GAT GGT 8875 
Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys Ala Asp Gly 
2790 2795 2800 

GCA GAC GAG AGC ATC GCA GCT GGT TGC TTG TAC AAC AGC ACT TGT GAC 8923 
Ala Asp Glu Ser He Ala Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp 
2805 2810 2815 

GAC CGT GAG TTC ATG TGC CAG AAC CGC CAG TGC ATC CCC AAG CAC TTC 8971 
Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys He Pro Lys His Phe 
2820 2825 2830 2835 
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GTG TGT GAC CAC GAC CGT GAC TGT GCA GAT GGC TCT GAT GAG TCC CCC 
Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser Asp Glu Ser Pro 
2840 2845 2850 

GAG TGT GAG TAC CCG ACC TGC GGC CCC AGT GAG TTC CGC TGT GCC AAT 
Glu Cys Glu Tyr Pro Thr Cys Gly Pro Ser Glu Phe Arg Cys Ala Asn 
2855 2860 2865 

GGG CGC TGT CTG AGC TCC CGC CAG TGG GAG TGT GAT GGC GAG AAT GAC 
Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp Gly Glu Asn Asp 
2870 2875 2880 

TGC CAC GAC CAG AGT GAC GAG GCT CCC AAG AAC CCA CAC TGC ACC AGC 
Cys His Asp Gin Ser Asp Glu Ala Pro Lys Asn Pro His Cys Thr Ser 
2885 2890 2895 

CCA GAG CAC AAG TGC AAT GCC TCG TCA CAG TTC CTG TGC AGC AGT GGG 
Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu Cys Ser Ser Gly 
2900 2905 2910 2915 

CGC TGT GTG GCT GAG GCA CTG CTC TGC AAC GGC CAG GAT GAC TGT GGC 
Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin Asp Asp Cys Gly 
2920 2925 2930 

GAC AGC TCG GAC GAG CGT GGC TGC CAC ATC AAT GAG TGT CTC AGC CGC 
Asp Ser Ser Asp Glu Arg Gly Cys His lie Asn Glu Cys Leu Ser Arg 
2935 2940 2945 

AAG CTC AGT GGC TGC AGC CAG GAC TGT GAG GAC CTC AAG ATC GGC TTC 
Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu Lys lie Gly Phe 
2950 2955 2960 

AAG TGC CGC TGT CGC CCT GGC TTC CGG CTG AAG GAT GAC GGC CGG ACG 
Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp Asp Gly Arg Thr 
2965 2970 2975 

TGT GCT GAT GTG GAC GAG TGC AGC ACC ACC TTC CCC TGC AGC CAG CGC 
Cys Ala Asp Val Asp Glu Cys Ser Thr Thr Phe Pro Cys Ser Gin Arg 
2980 2985 2990 2995 

TGC ATC AAC ACC CAT GGC AGC TAT AAG TGT CTG TGT GTG GAG GGC TAT 
Cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys Val Glu Gly Tyr 
3000 3005 3010 

GCA CCC CGC GGC GGC GAC CCC CAC AGC TGC AAG GCT GTG ACT GAC GAG 
Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala Val Thr Asp Glu 
3015 3020 3025 

GAA CCG TTT CTG ATC TTC GCC AAC CGG TAC TAC CTG CGC AAG CTC AAC 
Glu Pro Phe Leu lie Phe Ala Asn Arg Tyr Tyr Leu Arg Lys Leu Asn 
3030 3035 3040 

CTG GAC GGG TCC AAC TAC ACG TTA CTT AAG CAG GGC CTG AAC AAC GCC 
Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly Leu Asn Asn Ala 
3045 3050 3055 

GTT GCC TTG GAT TTT GAC TAC CGA GAG CAG ATG ATC TAC TGG ACA GAT 
Val Ala Leu Asp Phe Asp Tyr Arg Glu Gin Met lie Tyr Trp Thr Asp 
3060 3065 3070 3075 
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GTG ACC ACC CAG GGC AGC ATG ATC CGA AGG ATG CAC CTT AAC GGG AGC 
Val Thr Thr Gin Gly Ser Met He Arg Arg Met His Leu Asn Gly Ser 
3080 3085 3090 

AAT GTG CAG GTC CTA CAC CGT ACA GGC CTC AGC AAC CCC GAT GGG CTG 
Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn Pro Asp Gly Leu 
3095 3100 3105 

GCT GTG GAC TGG GTG GGT GGC AAC CTG TAC TGG TGC GAC AAA GGC CGG 
Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys Asp Lys Gly Arg 
3H0 3115 3120 

GAC ACC ATC GAG GTG TCC AAG CTC AAT GGG GCC TAT CGG ACG GTG CTG 
Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr Arg Thr Val Leu 
3125 3130 3135 

GTC AGC TCT GGC CTC CGT GAG CCC AGG GCT CTG GTG GTG GAT GTG CAG 
Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val Val Asp Val Gin 
3140 3145 3150 3155 

AAT GGG TAC CTG TAC TGG ACA GAC TGG GGT GAC CAT TCA CTG ATC GGC 
Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His Ser Leu He Gly 
3160 3165 3170 

CGC ATC GGC ATG GAT GGG TCC AGC CGC AGC GTC ATC GTG GAC ACC AAG 
Arg He Gly Met Asp Gly Ser Ser Arg Ser Val He Val Asp Thr Lys 
3175 3180 3185 

ATC ACA TGG CCC AAT GGC CTG ACG CTG GAC TAT GTC ACT GAG CGC ATC 
lie Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr Glu Arg He 
3190 3195 3200 

TAC TGG GCC GAC GCC CGC GAG GAC TAC ATT GAA TTT GCC AGC CTG GAT 
Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe Ala Ser Leu Asp 
3205 3210 3215 

GGC TCC AAT CGC CAC GTT GTG CTG AGC CAG GAC ATC CCG CAC ATC TTT 
Gly Ser Asn Arg His Val Val Leu Ser Gin Asp lie Pro His lie Phe 
3220 3225 3230 3235 

GCA CTG ACC CTG TTT GAG GAC TAC GTC TAC TGG ACC GAC TGG GAA ACA 
Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr Asp Trp Glu Thr 
3240 3245 3250 

AAG TCC ATT AAC CGA GCC CAC AAG ACC ACG GGC ACC AAC AAA ACG CTC 
Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Thr Asn Lys Thr Leu 
3255 3260 3265 

CTC ATC AGC ACG CTG CAC CGG CCC ATG GAC CTG CAT GTC TTC CAT GCC 
Leu He Ser Thr Leu His Arg Pro Met Asp Leu His Val Phe His Ala 
3270 3275 3280 

CTG CGC CAG CCA GAC GTG CCC AAT CAC CCC TGC AAG GTC AAC AAT GGT 
Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys Val Asn Asn Gly 
3285 3290 3295 

GGC TGC AGC AAC CTG TGC CTG CTG TCC CCC GGG GGA GGG CAC AAA TGT 
Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly Gly His Lys Cys 
3300 3305 3310 3315 
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GCC TGC CCC ACC AAC TTC TAC CTG GGC AGC GAT GGG CGC ACC TGT GTG 104 59 
Ala Cys Pro Thr Asn Phe Tyr Leu Gly Ser Asp Gly Arg Thr Cys Val 
332 0 3325 3330 

TCC AAC TGC ACG GCT AGC CAG TTT GTA TGC AAG AAC GAC AAG TGC ATC 10507 
Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn Asp Lys Cys He 
3335 3340 3345 

CCC TTC TGG TGG AAG TGT GAC ACC GAG GAC GAC TGC GGG GAC CAC TCA 10555 
Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly Asp His Ser 
3350 3355 3360 

GAC GAG CCC CCG GAC TGC CCT GAG TTC AAG TGC CGG CCC GGA CAG TTC 10603 
Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe 
3365 3370 3375 

CAG TGC TCC ACA GGT ATC TGC AC A AAC CCT GCC TTC ATC TGC GAT GGC 10651 
Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe He Cys Asp Gly 
3380 3385 3390 3395 

GAC AAT GAC TGC CAG GAC AAC AGT GAC GAG GCC AAC TGT GAC ATC CAC 10699 
Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys Asp He His 
3400 3405 3410 

GTC TGC TTG CCC AGT CAG TTC AAA TGC ACC AAC ACC AAC CGC TGT ATT 107 4 7 
Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn Arg Cys He 
3415 3420 3425 

CCC GGC ATC TTC CGC TGC AAT GGG CAG GAC AAC TGC GGA GAT GGG GAG 107 95 
Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu 
3430 3435 3440 

GAT GAG AGG GAC TGC CCC GAG GTG ACC TGC GCC CCC AAC CAG TTC CAG 108 43 
Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn Gin Phe Gin 
3445 3450 3455 

TGC TCC ATT ACC AAA CGG TGC ATC CCC CGG GTC TGG GTC TGC GAC CGG 10891 
Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp Val Cys Asp Arg 
3460 3465 3470 3475 

GAC AAT GAC TGT GTG GAT GGC AGT GAT GAG CCC GCC AAC TGC ACC CAG 10939 
Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin 
3480 3485 3490 

ATG ACC TGT GGT GTG GAC GAG TTC CGC TGC AAG GAT TCG GGC CGC TGC 10987 
Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser Gly Arg Cys 
3495 3500 3505 

ATC CCA GCG CGT TGG AAG TGT GAC GGA GAG GAT GAC TGT GGG GAT GGC 11035 
He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly 
3510 3515 3520 

TCG GAT GAG CCC AAG GAA GAG TGT GAT GAA CGC ACC TGT GAG CCA TAC 11083 
Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr 
3525 3530 3535 

CAG TTC CGC TGC AAG AAC AAC CGC TGC GTG CCC GGC CGC TGG CAG TGC 11131 
Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg Trp Gin Cys 
3540 3545 3550 3555 
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GAC TAC GAC AAC GAT TGC GGT GAC AAC TCC GAT GAA GAG AGC TGC ACC 1117 9 

Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Glu Ser Cys Thr 
3560 3565 3570 

CCT CGG CCC TGC TCC GAG AGT GAG TTC TCC TGT GCC AAC GGC CGC TGC 11227 

Pro Arg Pro Cys Ser Glu Ser Glu Phe Ser Cys Ala Asn Gly Arq Cys 
3575 3580 3585 

ATC GCG GGG CGC TGG AAA TGC GAT GGA GAC CAC GAC TGC GCG GAC GGC 11275 

He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp Cys Ala Asp Gly 
3590 3595 3600 

TCG GAC GAG AAA GAC TGC ACC CCC CGC TGT GAC ATG GAC CAG TTC CAG 11323 

Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met Asp Gin Phe Gin 
3605 3610 3615 

TGC AAG AGC GGC CAC TGC ATC CCC CTG CGC TGG CGC TGT GAC GCA GAC 11371 

Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Arg Cys Asp Ala Asp 
3620 3625 3630 3635 

GCC GAC TGC ATG GAC GGC AGC GAC GAG GAG GCC TGC GGC ACT GGC GTG 11419 

Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly Thr Gly Val 
3640 3645 3650 

CGG ACC TGC CCC CTG GAC GAG TTC CAG TGC AAC AAC ACC TTG TGC AAG 114 67 

Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn Thr Leu Cys Lys 
3655 3660 3665 

CCG CTG GCC TGG AAG TGC GAT GGC GAG GAT GAC TGT GGG GAC AAC TCA 11515 

Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser 
3670 3675 3680 

GAT GAG AAC CCC GAG GAG TGT GCC CGG TTC GTG TGC CCT CCC AAC CGG 11563 

Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe Val Cys Pro Pro Asn Arg 
3685 3690 3695 

CCC TTC CGT TGC AAG AAT GAC CGC GTC TGT CTG TGG ATC GGG CGC CAA 11611 . 

Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp He Gly Arg Gin 
3700 3705 3710 3715 

TGC GAT GGC ACG GAC AAC TGT GGG GAT GGG ACT GAT GAA GAG GAC TGT 11659 

Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys 
3720 3725 3730 

GAG CCC CCC ACA GCC CAC ACC ACC CAC TGC AAA GAC AAG AAG GAG TTT 11707 

Glu Pro Pro Thr Ala His Thr Thr His Cys Lys Asp Lys Lys Glu Phe 
3735 3740 3745 

CTG TGC CGG AAC CAG CGC TGC CTC TCC TCC TCC CTG CGC TGC AAC ATG 11755 

Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg Cys Asn Met 
3750 3755 3760 

TTC GAT GAC TGC GGG GAC GGC TCT GAC GAG GAG GAC TGC AGC ATC GAC 11803 

Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp Cys Ser He Asp 
3765 3770 3775 

CCC AAG CTG ACC AGC TGC GCC ACC AAT GCC AGC ATC TGT GGG GAC GAG 11851 

Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser He Cys Gly Asp Glu 
3780 3785 3790 3795 
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GCA CGC TGC GTG CGC ACC GAG AAA GCG GCC TAC TGT GCC TGC CGC TCG 
Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys Ala Cys Arg Ser 
3800 3805 3810 

GGC TTC CAC ACC GTG CCC GGC CAG CCC GGA TGC CAA GAC ATC AAC GAG 
Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin Asp He Asn Glu 
3815 3820 3825 

TGC CTG CGC TTC GGC ACC TGC TCC CAG CTC TGC AAC AAC ACC AAG GGC 
Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn Thr Lys Gly 
3830 3835 3840 

GGC CAC CTC TGC AGC TGC GCT CGG AAC TTC ATG AAG ACG CAC AAC ACC 
Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys Thr His Asn Thr 
3845 3850 3855 

TGC AAG GCC GAA GGC TCT GAG TAC CAG GTC CTG TAC ATC GCT GAT GAC 
Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr He Ala Asp Asp 
3860 3865 3870 3875 

AAT GAG ATC CGC AGC CTG TTC CCC GGC CAC CCC CAT TCG GCT TAC GAG 
Asn Glu He Arg Ser Leu Phe Pro Gly His Pro His Ser Ala Tyr Glu 
3880 3885 3890 

CAG GCA TTC CAG GGT GAC GAG AGT GTC CGC ATT GAT GCT ATG GAT GTC 
Gin Ala Phe Gin Gly Asp Glu Ser Val Arg He Asp Ala Met Asp Val 
3895 3900 3905 

CAT GTC AAG GCT GGC CGT GTC TAT TGG ACC AAC TGG CAC ACG GGC ACC 
His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp His Thr Gly Thr 
3910 3915 3920 

ATC TCC TAC CGC AGC CTG CCA CCT GCT GCG CCT CCT ACC ACT TCC AAC 
He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn 
3925 " 3930 3935 

CGC CAC CGG CGA CAG ATT GAC CGG GGT GTC ACC CAC CTC AAC ATT TCA 
Arg His Arg Arg Gin He Asp Arg Gly Val Thr His Leu Asn He Ser 
3940 3945 3950 3955 

GGG CTG AAG ATG CCC AGA GGC ATC GCC ATC GAC TGG GTG GCC GGA AAC 
Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp Val Ala Gly Asn 
3960 3965 3970 

GTG TAC TGG ACC GAC TCG GGC CGA GAT GTG ATT GAG GTG GCG CAG ATG 
Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu Val Ala Gin Met 
3975 * 3980 3985 

AAG GGC GAG AAC CGC AAG ACG CTC ATC TCG GGC ATG ATT GAC GAG CCC 
Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met He Asp Glu Pro 
3990 ' 3995 4000 

CAC GCC ATT GTG GTG GAC CCA CTG AGG GGG ACC ATG TAC TGG TCA GAC 
His Ala He Val Val Asp Pro Leu Arg Gly Thr Met Tyr Trp Ser Asp 
4005 4010 4015 

TGG GGC AAC CAC CCC AAG ATT GAG ACG GCA GCG ATG GAT GGG ACG CTT 
Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met Asp Gly Thr Leu 
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CGG GAG ACA CTG GTG CAG GAC AAC ATT CAG TGG CCC AC A GGC CTG GCC 
Arg Glu Thr Leu Val Gin Asp Asn lie Gin Trp Pro Thr Gly Leu Ala 
4040 4045 " 4050 

GTG GAT TAT CAC AAT GAG CGG CTG TAC TGG GCA GAC GCC AAG CTT TCA 
Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp Ala Lys Leu Ser 
4055 ' 4060 " 4065 

GTC ATC GGC AGC ATC CGG CTC AAT GGC ACG GAC CCC ATT GTG GCT GCT 
Val lie Gly Ser lie Arg Leu Asn Gly Thr Asp Pro He Val Ala Ala 
4070 4075 4080 

GAC AGC AAA CGA GGC CTA AGT CAC CCC TTC AGC ATC GAC GTC TTT GAG 
Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He Asp Val Phe Glu 
4085 4090 4095 

GAT TAC ATC TAT GGT GTC ACC TAC ATC AAT AAT CGT GTC TTC AAG ATC 
Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg Val Phe Lys lie 
4100 4105 4110 4115 

CAT AAG TTT GGC CAC AGC CCC TTG GTC AAC CTG ACA GGG GGC CTG AGC 
His Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly Gly Leu Ser 
4120 4125 4130 

CAC GCC TCT GAC GTG GTC CTT TAC CAT CAG CAC AAG CAG CCC GAA GTG 
His Ala Ser Asp Val Val Leu Tyr His Gin His Lys Gin Pro Glu Val 
4135 4140 4145 

ACC AAC CCA TGT GAC CGC AAG AAA TGC GAG TGG CTC TGC CTG CTG AGC 
Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser 
4150 4155 4160 

CCC AGT GGG CCT GTC TGC ACC TGT CCC AAT GGG AAG CGG CTG GAC AAC 
Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lvs Arg Leu Asp Asn 
4165 ' 4170 4175 

GGC ACA TGC GTG CCT GTG CCC TCT CCA ACG CCC CCC CCA GAT GCT CCC 
Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro 
4180 4185 4190 4195 

CGG CCT GGA ACC TGT AAC CTG CAG TGC TTC AAC GGT GGC AGC TGT TTC 
Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe 
4200 4205 ~ 4210 

CTC AAT GCA CGG AGG CAG CCC AAG TGC CGC TGC CAA CCC CGC TAC ACG 
Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin Pro Arg Tyr Thr 
4215 4220 4225 

GGT GAC AAG TGT GAA CTG GAC CAG TGC TGG GAG CAC TGT CGC AAT GGG 
Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys Arg Asn Gly 
4230 4235 4240 

GGC ACC TGT GCT GCC TCC CCC TCT GGC ATG CCC ACG TGC CGG TGC CCC 
Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr Cys Arg Cys Pro 
4245 4250 4255 

ACG GGC TTC ACG GGC CCC AAA TGC ACC CAG CAG GTG TGT GCG GGC TAC 
Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys Ala Gly Tyr 
4260 " 4265 4270 4275 
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TGT GCC AAC AAC AGC ACC TGC ACT GTC AAC CAG GGC AAC CAG CCC CAG 
Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly Asn Gin Pro Gin 
4280 4285 4290 

TGC CGA TGC CTA CCC GGC TTC CTG GGC GAC CGC TGC CAG TAC CGG CAG 
Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin 
4295 4300 4305 

TGC TCT GGC TAC TGT GAG AAC TTT GGC ACA TGC CAG ATG GCT GCT GAT 
Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gin Met Ala Ala Asp 
4310 4315 4320 

GGC TCC CGA CAA TGC CGC TGC ACT GCC TAC TTT GAG GGA TCG AGG TGT 
Gly Ser Arg Gin Cys Arg Cys Thr Ala Tyr Phe Glu Gly Ser Arg Cys 
4325 4330 4335 

GAG GTG AAC AAG TGC AGC CGC TGT CTC GAA GGG GCC TGT GTG GTC AAC 
Glu Val Asn Lys Cys Ser Arg Cys Leu Glu Gly Ala Cys Val Val Asn 
4340 4345 4350 4355 

AAG CAG AGT GGG GAT GTC ACC TGC AAC TGC ACG GAT GGC CGG GTG GCC 
Lys Gin Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly Arg Val Ala 
4360 4365 * 4370 

CCC AGC TGT CTG ACC TGC GTC GGC CAC TGC AGC AAT GGC GGC TCC TGT 
Pro Ser Cys Leu Thr Cys Val Gly His Cys Ser Asn Gly Gly Ser Cys 
4375 4380 4385 

ACC ATG AAC AGC AAA ATG ATG CCT GAG TGC CAG TGC CCA CCC CAC ATG 
Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys Pro Pro His Met 
4390 4395 4400 

ACA GGG CCC CGG TGT GAG GAG CAC GTC TTC AGC CAG CAG CAG CCA GGA 
Thr Gly Pro Arg Cys Glu Glu His Val Phe Ser Gin Gin Gin Pro Gly 
4405 4410 4415 

CAT ATA GCC TCC ATC CTA ATC CCT CTG CTG TTG CTG CTG CTG CTG GTT 
His lie Ala Ser lie Leu lie Pro Leu Leu Leu Leu Leu Leu Leu Val 
4420 4425 4430 4435 

CTG GTG GCC GGA GTG GTA TTC TGG TAT AAG CGG CGA GTC CAA GGG GCT 
Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val Gin Gly Ala 
4440 4445 " 4450 

AAG GGC TTC CAG CAC CAA CGG ATG ACC AAC GGG GCC ATG AAC GTG GAG 
Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala Met Asn Val Glu 
4455 4460 4465 

ATT GGA AAC CCC ACC TAC AAG ATG TAC GAA GGC GGA GAG CCT GAT GAT 
lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp 
4470 4475 4480 

GTG GGA GGC CTA CTG GAC GCT GAC TTT GCC CTG GAC CCT GAC AAG CCC 
Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro 
4485 ' 4490 * 4495 

ACC AAC TTC ACC AAC CCC GTG TAT GCC ACA CTC TAC ATG GGG GGC CAT 
Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr Met Gly Gly His 
4500 4505 4510 ~ 4515 
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GGC AGT CGC CAC TCC CTG GCC AGC ACG GAC GAG AAG CGA GAA CTC CTG 14 059 
Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu 
4520 4525 ' 4530 

GGC CGG GGC CCT GAG GAC GAG ATA GGG GAC CCC TTG GCA TAGGGCCCTG CC 14110 

CCGTCGGACT GCCCCCAGAA AGCCTCCTGC CCCCTGCCGG TGAAGTCCTT CAGTGAGCCC 14170 
Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu Ala 
4535 4540 

CTCCCCAGCC AGCCCTTCCC TGGCCCCGCC GGATGTATAA ATGTAAAAAT GAAGGAATTA 14 230 

CATTTTATAT GTGAGCGAGC AAGCCGGCAA GCGAGCACAG TATTATTTCT CCATCCCCTC 14290 

CCTGCCTGCT CCTTGGCACC CCCATGCTGC CTTCAGGGAG ACAGGCAGGG AGGGCTTGGG 14350 

GCTGCACCTC CTACCCTCCC ACCAGAACGC ACCCCACTGG GAGAGCTGGT GGTGCAGCCT 14 410 

TCCCCTCCCT G TAT AAG AC A CTTTGCCAA3 GCTCTCCCCT CTCGCCCCAT CCCTGCTTGC 14 470 

CCGCTCCCAC AGCTTCCTGA GGGCTAATTC TGGGAAGGGA GAGTTCTTTG CTGCCCCTGT 14530 

CTGGAAGACG TGGCTCTGGG TGAGGTAGGC GGGAAAGGAT GGAGTGTTTT AGTTCTTGGG 14 590 

GGAGGCCACC CCAAACCCCA GCCCCAACTC CAGGGGCACC TATGAGATGG CCATGCTCAA 14 650 

CCCCCCTCCC AGACAGGCCC TCCCTGTCTC CAGGGCCCCC ACCGAGGTTC CCAGGGCTGG 14 710 

AGACTTCCTC TGGTAAACAT TCCTCCAGCC TCCCCTCCCC TGGGGACGCC AAGGAGGTGG 14770 

GCCACACCCA GGAAGGGAAA GCGGGCAGCC CCGTTTTGGG GACGTGAACG TTTTAATAAT 14830 

TTTTGCTGAA TTCTTTACAA CTAAATAACA CAGATATTCT TATAAATAAA ATTGTAAAAA 14890 

AAAAAA 14896 
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Met Leu Thr Pro Pro Le u Leu Leu Leu Leu Pro L eu Leu Ser Ala Leu 

Val Ala Aia Ala He Asp Ala Pro Lys'iThf -Cys "Ser : Pro Lys Gln^Phe 

• ^ • • - 20 - v ; ; 25. ' ." ... 30 

, Ala'5Cys; Arg Asp iGln.'l-lp|Thr.: Cys>IleTser7Lys Gly .'-Trp Arg Cys*Asp 

<K ••'-35 • . 40 45- " V.>Si#i-. 




Asj C> 1 T st Cl> S.ei . Asp G a Glyg-Prci I 1 Cyfe^Arc Glu "cu Gin 

iooT ' * 105'"" *" ' " "no 

Gly Asn Cys Ser Arg Leu Gly Cys Gin His His Cys Val Pro Thr Leu 

115 120 125 

Asp Gly Pro Thr Cys Tyr Cys Asn Ser Ser Phe Gin Leu Gin Ala Asp 

130 135 140 

Gly Lys Thr Cys Lys Asp Phe Asp Glu Cys Ser Val Tyr Gly Thr Cys 
145 150 155 " 160 

Ser Gin Leu Cys Thr Asn Thr Asp Gly Ser Phe He Cys Gly Cys Val 

165 170 175 

Glu Gly Tyr Leu Leu Gin Pro Asp Asn Arg Ser Cys Lys Ala Lys Asn 

180 185 " " 190 

Glu Pro Val Asp Arg Pro Pro Val Leu Leu He Ala Asn Ser Gin Asn 

195 200 205 

He Leu Ala Thr Tyr Leu Ser Gly Ala Gin Val Ser Thr He Thr Pro 

210 215 220 

Thr Ser Thr Arg Gin Thr Thr Ala Met Asp Phe Ser Tyr Ala Asn Glu 
225 230 235 240 

Thr Val Cys Trp Val His Val Gly Asp Ser Ala Ala Gin Thr Gin Leu 

245 250 255 

Lys Cys Ala Arg Met Pro Gly Leu Lys Gly Phe Val Asp Glu His Thr 

260 265 270 

lie Asn He Ser Leu Ser Leu His His Val Glu Gin Met Ala He Asp 

275 280 285 

Trp Leu Thr Gly Asn Phe Tyr Phe Val Asp Asp He Asp Asp Arg He 

290 295 300 

Phe Val Cys Asn Arg Asn Gly Asp Thr Cys Val Thr Leu Leu Asp Leu 
305 * 310 315 320 

Glu Leu Tyr Asn Pro Lys Gly lie Ala Leu Asp Pro Ala Met Gly Lys 

325 330 335 

Val Phe Phe Thr Asp Tyr Gly Gin lie Pro Lys Val Glu Arg Cys Asp 

340 345 350 

Met Asp Gly Gin Asn Arg Thr Lys Leu Val Asp Ser Lys He Val Phe 

355 ~ 360 365 

Pro His Gly He Thr Leu Asp Leu Val Ser Arg Leu Val Tyr Trp Ala 

370 375 380 

Asp Ala Tyr Leu Asp Tyr He Glu Val Val Asp Tyr Glu Gly Lys Gly 
385 390 395 400 

Arg Gin Thr He He Gin Gly He Leu He Glu His Leu Tyr Gly Leu 

405 410 415 

Thr Val Phe Glu Asn Tyr Leu Tyr Ala Thr Asn Ser Asp Asn Ala Asn 

420 425 430 

Ala Gin Gin Lys Thr Ser Val He Arg Val Asn Arg Phe Asn Ser Thr 

435 440 445 

Glu Tyr Gin Val Val Thr Arg Val Asp Lys Gly Gly Ala Leu His lie 
450 455 ' 460 
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Tyr His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu Asn 
465 470 475 480 

Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Asp He Cys Leu Leu Ala 

485 490 495 

Asn Ser His Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser Leu 

500 505 " 510 

Gly Ser Asp Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe Leu 

515 520 525 

Val Tyr Gly Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met Gly 

530 535 540 

Ala Lys Val Pro Asp Glu His Met He Pro He Glu Asn Leu Met Asn 
545 550 555 560 

Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala 

565 570 575 

Asp Thr Thr Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr Glu 

580 585 ' 590 

Arg Glu Thr He Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala 

595 600 605 

Val Asp Trp Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys 

610 615 " 620 

Lys Thr He Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg Lys 
625 • 630 635 640 

Thr Leu He Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp 

645 650 655 

Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys 

660 665 670 

Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala TrD Met Asp Gly Ser His 

675 680 685 

Arg Asp He Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly Leu 

690 695 700 

Ser Leu Asp He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe Tyr 
705 710 715 720 

Asp Arg He Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He Val 

725 730 735 

Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His Gly 

740 745 750 

Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg Leu 

755 760 765 

Glu Arg Gly Val Gly Gly Ala Pro Pro Thr Val Thr Leu Leu Arg Ser 

770 " 775 780 

Glu Arg Pro Pro He Phe Glu He Arg Met Tyr Asp Ala Gin Gin Gin 
785 ' 790 795 800 

Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser Ser 

805 810 815 

Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu Asp 

820 825 ' 830 

Gin Val Leu Asp Ala Asp Gly Val Thr Cys Leu Ala Asn Pro Ser Tyr 
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Val Pro Pro Pxo Gl n ' Cys CI n %Prc ? CI y ^3jlu PI e^Alc 
85U J , . % 855^-> £<f ' ''-'860 



''Arg Cys lie- 
•865 



Cys 'Ala^As^^er 
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aUgr-S^Mita ar Cys p„ Pro , As „ a „ >A . s „ Cys - ikla 

Ser Gly ^ Cys rji >„ T £ ^ jg--^ ,^ ^ ^jgg 
cys Sly , sp ,^ g ^ Asp tf tt :.&\i*.&-™?$^ Pro 
Pto ^ ^ xg^lg^^ iis.cys.Tle. 

JJP P» Ser Val Lys Ph Gly Cys:Lys Asp £ 
1110 ■ , 



^ «a Trp v« eg Asp Giy Asp ; Cyi CJu ser l»» 

SIU Gi " Glu Ser ie » ^fffto £^ to Hi/pS 

toig Th, m Cys.^ MP.te^ l^ 

Kis Asn Cys 
1195 

o Leu Glv Met 

Asp Asn »i. iS r-^ Gin IX. GlnJ.^y! c y i Ala Lys L ys 
Cys ^ L y s Cy5 Asp B1 „ Lys phe ^1230 c ^ 

ryr cij Gly Tr P v.i Le„ oiu p„ Asp 01y Glu ser ^ flrg Ser Leu 

Asp P„ Phe L y s Pro^Phe He He Phe Ser Asn Ar 9 ° His Glu He Ar g 
Arg U. Asp LeuMs L y s Gly Asp Iyr a.™ Leu Val pro 01y ^J 
Arg Asn Tnr^le ala Leu flsp Ph. hi/SJ ser Gln Ser Ala^u T y r 
Trp Thr^sp val Val Glu Asp L y sHe T y r Arg Gly L^u L eu Asp 



lit S " SS " ^ C ^ S« Asn cys ser Va! A la Pro 

»ly Glu Gly II. v.! Cys Ser C ys Pro L.^Gl'y „e t Gl u leu Gly^ 



As^Gly Ala Leu Ihr Ser^he Glu Val Val He a, 1 ^ Gly Leu Ala 
Thr Pro Glu Gl y Leu Ma Val Asp Trp He Ala"" Asn He T y r Trp 
V,l Glu ser Ash^eu Asp Gin He Glu Val^la Lys Leu Asp Gl y "hr 
Leu Ar 9 Thr^hr Leu Leu Ala Gly Aspil" Glu His Pro ArgMa He 
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Ala Leu Asp Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp Ala 

1395 1400 ' 1405 

Ser Leu Pro Arg He Glu Ala Ala Ser Met Ser Gly Ala Gly Arg Arg 

1410 1415 1420 

Thr Val His Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu Thr 
425 1430 1435 1440 

Val Asp Tyr Leu Glu Lys Arg He Leu Trp He Asp Ala Arg Ser Asp 

1445 1450 1455 

Ala He Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu 

1460 1465 1470 

Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly 

1475 1480 1485 

Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys Ala 

1490 1495 " 1500 

Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr 
505 * 1510 1515 1520 

Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala 

1525 " 1530 1535 

Pro Asn Pro Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu 

1540 1545 1550 

Cys Leu He Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu 

1555 1560 1565 

Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys Phe 

1570 1575 1580 

Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp Ala 
585 " 1590 1595 1600 

Pro Tyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp Asn 

1605 1610 1615 

Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp Ser 

1620 * ' 1625 " 1630 

Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr Gly 

1635 1640 ' 1645 

Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu Ala 

1650 1655 1660 

Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr Asn 
665 * 1670 1675 1680 

Lys Lys Gin He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn Ala 

1685 1690 1695 

Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro Leu 

1700 1705 1710 

Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala Asn 

1715 1720 1725 

Met Asp Gly Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro 

1730 1735 1740 

Val Gly Leu Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He Ser 
745 " 1750 1755 1760 

Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Gly Leu 

1765 1770 1775 

Glu Val He Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu 

1780 1785 1790 

Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys 

1795 1800 • 1805 

Met Gly Thr Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg 

1810 1815 - 1820 

Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser He 
825 1830 1835 1840 

Gin Leu Asp His Lys Gly Thr Asn Pro Cys Ser Val Asn Asn Gly Asp 

1845 ' 1850 1855 

Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys Met 
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1865 



1870 



Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly 

1875 1880 1885 

Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly lie Arg Gly lie 

1890 1895 1900 

Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly 
905 1910 1915 1920 

Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr He 

1925 1930 1935 

Tyr Trp Val Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp 

1940 1945 " 1950 

Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu 

1955 1960 1965 

Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp Gin 

1970 1975 1980 

Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr 
985 1990 1995 " 2000 

Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala lie Thr Val His 

2005 2010 2015 

Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg 

2020 2025 2030 

lie Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn 

2035 2040 " 2045 

Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Asp Gly 

2050 2055 2060 

Lys Leu Tyr Trp Cys Asp Ala Arg Thr Asp Lys He Glu Arg He Asp 
065 2070 2075 ' 2080 

Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met 

2085 2090 2095 

Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser Asp 

2100 2105 2110 

Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Ser Lys Asp Asn Ala 

2115 2120 2125 

Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys Asp 

2130 2135 2140 

He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys Ala 
145 2150 2155 2160 

Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Arg Gly 

2165 2170 ' 2175 

Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly Ala 

2180 2185 2190 

Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr He 

2195 2200 2205 

Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val 

2210 2215 2220 

Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu Ala 
225 2230 2235 2240 

Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He Phe 

2245 2250 2255 

Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp Gly 

2260 2265 2270 

Ser Arg Arg He Thr He Val Glu Asn Val Gly Ser Val Glu Gly Leu 

2275 2280 2285 

Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr 

2290 2295 " 2300 

Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala Phe 
305 2310 2315 " 2320 

Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg Ala 



2325 



2330 



2335 
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Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp Asn 

2340 2345 ~ 2350 

Glu Gin His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn Val 

2355 2360 2365 

Leu Thr Leu lie Glu Lys Asp lie Arg Thr Pro Asn Gly Leu Ala He 

2370 2375 " 2380 

Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys 
385 2390 2395 2400 

He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys 

2405 2410 ' 2415 

Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His He 

2420 2425 2430 

Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His 

2435 2440 2445 

Val Gly Ser Asn Met Lys Leu Leu Arg Val Asp He Pro Gin Gin Pro 

2450 2455 2460 

Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu Ser 
465 2470 2475 2480 

Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr 

2485 2490 * 2495 

His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu Gin 

2500 2505 2510 

Asp Asp Leu Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp 

2515 2520 2525 

Glu Phe Glu Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys 

2530 2535 2540 

Asp Gly Val Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr 
545 2550 2555 2560 

Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly 

2565 2570 2575 

Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly 

2580 2585 2590 

Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val Gly 

2595 2600 2605 

Glu Phe Arg Cys Arg Asp Gly Thr Cys He Gly Asn Ser Ser Arg Cys 

2610 2615 2620 

Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys Ser 
625 2630 2635 2640 

Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val Leu 

2645 2650 2655 

Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp Val 

2660 2665 2670 

Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp Cys 

2675 2680 2685 

Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro 

2690 2695 2700 

Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp 
705 2710 2715 2720 

Cys Glu His Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser Glu 

2725 2730 ~ 2735 

Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp Leu 

2740 2745 2750 

Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala His 

2755 2760 2765 

Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr 

2770 ' 2775 2780 

His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys 
785 2790 2795 2800 

Ala Asp Gly Ala Asp Glu Ser He Ala Ala Gly Cys Leu Tyr Asn Ser 
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2805 



2810 



2815 



Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys He Pro 

2820 2825 2830 

Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser Asp 

2835 2840 " " 2845 

Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Ser Glu Phe Arg 

2850 2855 2860 

Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp Gly 
865 2870 2875 2880 

Glu Asn Asp Cys His Asp Gin Ser Asp Glu Ala Pro Lys Asn Pro His 

2885 2890 2895 

Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu Cys 

2900 2905 2910 

Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin Asp 

2915 2920 2925 

Asp Cys Gly Asp Ser Ser Asp Glu Arg Gly Cys His He Asn Glu Cys 

2930 2935 ' 2940 

Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu Lys 
945 2950 ' ' 2955 2960 

He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp Asp 

2965 2970 2975 

Gly Arg Thr Cys Ala Asp Val Asp Glu Cys Ser Thr Thr Phe Pro Cys 

2980 2985 2990 

Ser Gin Arg Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys Val 

2995 3000 " 3005 

Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala Val 

3010 3015 ' ' 3020 

Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu Arg 
025 3030 3035 3040 

Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly Leu 

3045 3050 3055 

Asn Asn Ala Val Ala Leu Asp Phe Asp Tyr Arg Glu Gin Met He Tyr 

3060 3065 ' 3070 

Trp Thr Asp Val Thr Thr Gin Gly Ser Met He Arg Arg Met His Leu 

3075 3080 3085 

Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn Pro 

3090 3095 3100 

Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys Asp 
105 3110 3115 3120 

Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr Arg 

3125 3130 3135 

Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val Val 

3140 3145 3150 

Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His Ser 

3155 " 3160 ' 3165 

Leu lie Gly Arg He Gly Met Asp Gly Ser Ser Arg Ser Val He Val 

3170 3175 3180 

Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr 
185 3190 ' 3195 3200 

Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe Ala 

3205 3210 " 3215 

Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He Pro 

3220 " 3225 3230 

His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Txp Thr Asp 

3235 3240 3245 

Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Thr Asn 

3250 3255 3260 

Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His Val 
265 3270 3275 3280 
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Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys Val 

3285 3290 3295 

Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly Gly 

3300 3305 3310 

His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Ser Asp Gly Arg 

3315 3320 3325 

Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn Asp 

3330 ' 3335 3340 

Lys Cys lie Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly 
345 3350 3355 3360 

Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg Pro 

3365 3370 " 3375 

Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe He 

3380 3385 3390 

Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys 

3395 3400 3405 

Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn 

3410 3415 3420 

Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly 
425 3430 3435 3440 

Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn 

3445 3450 " 3455 

Gin Phe Gin Cys Ser lie Thr Lys Arg Cys He Pro Arg Val Trp Val 

3460 3465 3470 

Cys Asp Arg Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn 

3475 3480 3485 

Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser 

3490 3495 " 3500 

Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asd Asp Cys 
505 " " 3510 3515 3520 

Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys 

3525 3530 * 3535 

Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg 

3540 3545 3550 

Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Glu 

3555 " 3560 " 3565 

Ser Cvs Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Ser Cys Ala Asn 

3570 3575 3580 

Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp Cys 
585 3590 3595 3600 

Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met Asp 

3605 3610 3615 

Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Arg Cys 

3620 3625 3630 

Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly 

3635 ' " 3640 3645 

Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn Thr 

3650 3655 3660 

Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly 
665 " 3670 3675 3680 

Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe Val Cys Pro 

3685 3690 3695 

Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp He 

3700 3705 3710 

Gly Arg Gin Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu 

3715 . " 3720 3725 

Glu Asp Cys Glu Pro Pro Thr Ala His Thr Thr His Cys Lys Asp Lys 

3730 3735 3740 

Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg 
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745 



37 50 



3755 



3760 



Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp Cys 

3765 3770 3775 

Ser He Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser He Cys 

3780 3785 3790 

Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys Ala 

3795 3800 3805 

Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin Asp 

3810 3815 3820 

He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn 
825 3830 3835 3840 

Thr Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys Thr 

3845 3850 3855 

His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr He 

3860 3865 " 3870 

Ala Asp Asp Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His Ser 

3875 3880 3885 

Ala Tyr Glu Gin Ala Phe Gin Gly Asp Glu Ser Val Arg He Asp Ala 

3890 3895 " 3900 

Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp His 
905 3910 ' 3915 3920 

Thr Gly Thr He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro Thr 

3925 3930 3935 

Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His Leu 

3940 3945 3950 

Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp Val 

3955 3960 3965 

Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu Val 

3970 3975 ' " 3980 

Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met He 
985 3990 3995 4000 

Asp Glu Pro His Ala lie Val Val Asp Pro Leu Arg Gly Thr Met Tyr 

4005 4010 " " 4015 

Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met Asp 

4020 4025 4030 

Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro Thr 

4035 4040 4045 

Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp Ala 

4050 4055 4060 

Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro He 
065 4070 4075 4080 

Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He Asp 

4085 4090 4095 

Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr lie Asn Asn Arg Val 

4100 4105 4110 

Phe Lys He His Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly 

4115 4120 4125 

Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys Gin 

4130 4135 4140 

Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys 
145 4150 4155 4160 

Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys Arg 

4165 4170 4175 

Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro 

4180 4185 4190 

Asp Ala Pro Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly 

4195 " 4200 4205 

Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin Pro 
4210 4215 4220 
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Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys 
225 4230 4235 * 4240 

Arg Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr Cys 

4245 4250 4255 

Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys 

4260 4265 4270 

Ala Gly Tyr Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly Asn 

4275 4280 ' 4285 

Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys Gin 

4290 4295 4300 

Tyr Arg Gin Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gin Met 
305 ' 4310 4315 ~ " 4320 

Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Ala Tyr Phe Glu Gly 

4325 4330 ' 4335 

Ser Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Glu Gly Ala Cys 

4340 4345 4350 

Val Val Asn Lys Gin Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly 

4355 4360 4365 

Arg Val Ala Pro Ser Cys Leu Thr Cys Val Gly His Cys Ser Asn Gly 

4370 4375 4380 

Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys Pro 
385 4390 4395 4400 

Pro His Met Thr Gly Pro Arg Cys Glu Glu His Val Phe Ser Gin Gin 

4405 4410 4415 

Gin Pro Gly His lie Ala Ser lie Leu lie Pro Leu Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Val Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val 

4435 4440 4445 

Gin Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala Met 

4450 * " 4455 ' 4460 

Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu 
465 4470 4475 ~ 4480 

Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp Pro 

4485 4490 4495 

Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr Met 

4500 4505 4510 

Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg 

4515 4520 4525 

Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu He Gly Asp Pro Leu Ala 
4530 4535 " 4540 
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As a below named inventor, 1 hereby declare that: 

My residence, post office address and citizenship are as stated below at 201 et seq. underneath my name. 

I believe I am the original, first and sole inventor if only one name is listed at 201 below, or an original, first and joint inventor if plural names are 
listed at 201 et seq. below, of the subject matter which is claimed and for which a patent is sought on the invention entitled 

Alpha (2) Macroglobulin Receptor as a Heat Shock Protein Receptor and Uses Thereof 

and for which a patent application: 

b is attached hereto and includes amendment(s) filed on of applicable) 

□ was filed in the United States on as Application No. (for dedamnon not accompany app i,canon) 

with amendment(s) filed on (,/a PP i,cabie> 

o was filed as PCT international Application No. on and was amended under PCT Article 19 on (,/a PP i,cabie) 



>e material to patentability as defined in Title 37, Code of Federal Regulations, 



I acknowledge the duty to disclose information known to n 
gj.56. 

1 hereby claim foreign priority benefits under Title 35, United States Code, §119(a)-(d) of any foreign application(s) for patent or inventor's 
ggrtificate listed below and have also identified below any foreign application for patent or inventor's certificate having a filing date before that of 
the application on which priority is claimed: 



EARLIEST FOREIGN APPLICATION(S), IF ANY, FILED PRIOR TO THE FILING DATE OF THE APPLICATION 


APPLICATION NUMBER 


COUNTRY 


DATE OF FILING 
(day, month, year) 


PRIORITY 
CLAIMED 








YES o NO □ 








YES □ NO □ 



J. hereby claim the benefit under Title 35, United States Code, §1 19(e) of any United States provisional application(s) listed below. 



APPLICATION NUMBER 



FILING DATE 



I hereby claim the benefit under Title 35, United States Code, § 1 20 of any United States application(s) listed below and, insofar as the subject matter 
of each of the claims of this application is not disclosed in the prior United States application in the manner provided by the first paragraph of Title 
35 , United States Code §112,1 acknowledge the duty to disclose information which is material to patentability as defined in Title 37, Code of Federal 
Regulations, § 1 .56 which became available between the filing date of the prior application and the national or PCT international filing date of this 
application: 



APPLICATION SERIAL NO. 


FILING DATE 


STATUS 


PATENTED 


PENDING 


ABANDONED 


09/625,137 


July 25, 2000 



















POWER OF ATTORNEY: As a named inventor, I hereby appoint S. Leslie Misrock (Reg. No. 1 8872), Harry C. Jones, III (Reg. No. 20280), Berj A. Terzian (Reg. 
No. 20060), David Weild, III (Reg. No. 2 1094), Jonathan A. Marshall (Reg. No. 24614), Barry D. Rein (Reg. No. 2241 1), Stanton T. Lawrence, III (Reg. No 25736), 
Charles E. McKenney (Reg. No. 22795), Philip T. Shannon (Reg. No. 24278), Francis E. Morris (Reg. No. 24615), Charles E. Miller (Reg. No. 24576), Gidon D. 
Stern (Reg. No. 27469), John J. Lauter, Jr. (Reg. No. 27814), Brian M. Poissant (Reg. No. 28462), Brian D. Coggio (Reg. No. 27624), Rory J. Raddmg (Reg No. 
28749), Stephen J. Harbulak (Reg. No. 29166), Donald J. Goodell (Reg. No. 19766), James N. Palik (Reg. No. 25510), Thomas E. Friebel (Reg. No. 29258), Lairra 
A. Coruzzi (Reg. No. 30742), Jennifer Gordon (Reg. No. 30753), Allan A. Fanucci (Reg. No. 30256), Geraldine F. Baldwin (Reg. No. 31232), Victor N. Balancia 
(Reg. No. 31231), Samuel B. Abrams (Reg. No. 30605), Steven I. Wallach (Reg. No. 35402), Marcia H. Sundeen (Reg. No. 30893), Paul J. Zegger (Reg. No. 33821), 
Edmond R. Bannon (Reg. No. 321 10), Bruce J. Barker (Reg. No 33291), Adriane M Antler (Reg. No. 32605), Thomas G. Rowan (Reg. No. 34419), James G 
Markey (Reg. No. 3 1636), Thomas D. Kohler (Reg. No. 32797), Scott D. Stimpson (Reg. No. 33607), Gary S. Williams (Reg. No. 31066), William S. Galham (Reg. 
No. 33885), Ann L. Gisolfi (Reg. No. 31956), Todd A. Wagner (Reg. No. 35399), Scott B. Familant (Reg. No. 35514), Kelly D Talcott (Reg. No. 39582), Francis 
D. Cerrito (Reg.No.38100), Anthony M. Insogna (Reg. No. 35203), Brian M Rothery (Reg. No 35340), Brian D. Siff (Reg. No. 35679), and Alan Tenenbaum (Reg. 
No. 34939), all of Pennie & Edmonds LLP, whose addresses are 1 155 Avenue of the Americas, New York, New York 10036, 1667 K Street N.W , Washington, DC 
20006 and 3300 Hillview Avenue, Palo Alto, CA 94304, and each of them, my attorneys, to prosecute this application, and to transact all business m the Patent and 
Trademark Office connected therewith. 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are believed to be true, and further 
that these statements were made with the knowledge that willful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 
1001 of Title 18 of the United States Code and that such willful false statements may jeopardize the validity of the application or any patent issuing thereon. 
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SEQUENCE LISTING 



<110> Pramod K. Srivastava 

<12 0> ALPHA (2) MACROGLOBULIN RECEPTOR AS A HEAT SHOCK 
PROTEIN RECEPTOR AND USES THEREOF 

<130> 8449-128 

<140> 
<141> 

<150> 09/625,137 
<151> 2000-07-25 

<150> 60/209,095 
<151> 2000-06-02 

<160> 57 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 14849 

<212> DNA 

<213> Mus musculus 

<400> 1 

cgctgctccc cgccagtgca ctgaggaggc ggaaacgggg gagcccctag tgctccatca 60 

ggcccctacc aaggcacccc catcgggtcc acgcccccca ccccccaccc cgcctcctcc 120 

caattgtgca tttttgcagc cggagtcggc tccgagatgg ggctgtgagc ttcgccctgg 180 

gagggggaga ggagcgagga gtaaagcagg ggtgaagggt tcgaatttgg gggcaggggg 24 0 

cgcacccgcg tcagcaggcc cttcccaggg ggctcggaac tgtaccattt cacctatgcc 300 

cctggttcgc tttgcttaag gaaggataag atagaagagt cggggagagg aagataaagg 3 60 

gggacccccc aattgggggg ggcgaggaca agaagtaaca ggaccagagg gtgggggctg 42 0 

ctgtttgcat cggcccacac catgctgacc ccgccgttgc tgctgctcgt gccgctgctt 480 

tcagctctgg tctccggggc cactatggat gcccctaaaa cttgcagccc taagcagttt 540 

gcctgcagag accaaatcac ctgtatctca aagggctggc ggtgtgacgg tgaaagagat 600 

tgccccgacg gctctgatga agcccctgag atctgtccac agagtaaagc ccagagatgc 660 

ccgccaaatg agcacagttg tctggggact gagctatgtg tccccatgtc tcgtctctgc 72 0 

aacgggatcc aggactgcat ggatggctca gacgagggtg ctcactgccg agagctccga 780 

gccaactgtt ctcgaatggg ttgtcaacac cattgtgtac ctacacccag tgggcccacg 840 

tgctactgta acagcagctt ccagctcgag gcagatggca agacgtgcaa agattttgac 900 

gagtgttccg tgtatggcac ctgcagccag ctttgcacca acacagatgg ctccttcaca 960 

tgtggctgtg ttgaaggcta cctgctgcaa ccggacaacc gctcctgcaa ggccaagaat 102 0 

gagccagtag atcggccgcc agtgctactg attgccaact ctcagaacat cctagctacg 1080 

tacctgagtg gggcccaagt gtctaccatc acacccacca gcacccgaca aaccacggcc 1140 

atggacttca gttatgccaa tgagaccgta tgctgggtgc acgttgggga cagtgctgcc 12 0 0 

cagacacagc tcaagtgtgc ccggatgcct ggcctgaagg gctttgtgga tgagcatacc 1260 

atcaacatct ccctcagcct gcaccacgtg gagcagatgg caatcgactg gctgacggga 1320 

aacttctact ttgtcgacga cattgacgac aggatctttg tctgtaaccg aaacggggac 1380 

acctgtgtca ctctgctgga cctggaactc tacaacccca aaggcatcgc cttggacccc 1440 

gccatgggga aggtgttctt cactgactac gggcagatcc caaaggtgga gcgctgtgac 1500 

atggatggac agaaccgcac caagctggtg gatagcaaga tcgtgtttcc acacggcatc 1560 

accctggacc tggtcagccg cctcgtctac tgggcggacg cctacctaga ctacatcgag 1620 

gtggtagact acgaagggaa gggtcggcag accatcatcc aaggcatcct gatcgagcac 1680 

ctgtacggcc tgaccgtgtt tgagaactat ctctacgcca ccaactcgga caatgccaac 1740 



1 



acgcagcaga agacgagcgt gatccgagtg aaccggttca acagtactga gtaccaggtc 1800 

gtcacccgtg tggacaaggg tggtgccctg catatctacc accagcgacg ccagccccga 1860 

gtgcggagtc acgcctgtga gaatgaccag tacgggaagc caggtggctg ctccgacatc 192 0 

tgcctcctgg ccaacagtca caaggcaagg acctgcaggt gcaggtctgg cttcagcctg 1980 

ggaagtgatg ggaagtcttg taagaaacct gaacatgagc tgttcctcgt gtatggcaag 2 04 0 

ggccgaccag gcatcattag aggcatggac atgggggcca aggtcccaga tgagcacatg 210 0 

atccccatcg agaaccttat gaatccacgc gctctggact tccacgccga gaccggcttc 2160 

atctactttg ctgacaccac cagctacctc attggccgcc agaaaattga tggcacggag 2220 

agagagacta tcctgaagga tggcatccac aatgtggagg gcgtagccgt ggactggatg 2280 

ggagacaatc tttactggac tgatgatggc cccaagaaga ccattagtgt ggccaggctg 2 340 

gagaaagccg ctcagacccg gaagactcta attgagggca agatgacaca ccccagggcc 2400 

attgtagtgg atccactcaa tgggtggatg tactggacag actgggagga ggaccccaag 2460 

gacagtcggc gagggcggct cgagagggct tggatggacg gctcacaccg agatatcttt 2 52 0 

gtcacctcca agacagtgct ttggcccaat gggctaagcc tggatatccc agccggacgc 2580 

ctctactggg tggatgcctt ctatgaccga attgagacca tactgctcaa tggcacagac 2640 

cggaagattg tatatgaggg tcctgaactg aatcatgcct tcggcctgtg tcaccatggc 2700 

aactacctct tttggaccga gtaccggagc ggcagcgtct accgcttgga acggggcgtg 2760 

gcaggcgcac cgcccactgt gacccttctg cgcagcgaga gaccgcctat ctttgagatc 2820 

cgaatgtacg acgcgcacga gcagcaagtg ggtaccaaca aatgccgggt aaataacgga 2 880 

ggctgcagca gcctgtgcct cgccaccccc gggagccgcc agtgtgcctg tgccgaggac 2 940 

caggtgttgg acacagatgg tgtcacctgc ttggcgaacc catcctacgt gcccccaccc 3 000 

cagtgccagc cgggccagtt tgcctgtgcc aacaaccgct gcatccagga gcgctggaag 3060 

tgtgacggag acaacgactg tctggacaac agcgatgagg ccccagcact gtgccatcaa 3120 

cacacctgtc cctcggaccg attcaagtgt gagaacaacc ggtgtatccc caaccgctgg 3180 

ctctgtgatg gggataatga ttgtggcaac agcgaggacg aatccaatgc cacgtgctca 3240 

gcccgcacct gtccacccaa ccagttctcc tgtgccagtg gccgatgcat tcctatctca 3300 

tggacctgtg atctggatga tgactgtggg gaccggtccg atgagtcagc ctcatgcgcc 33 60 

taccccacct gcttccccct gactcaattt acctgcaaca atggcagatg tattaacatc 3420 

aactggcggt gtgacaacga caatgactgt ggggacaaca gcgacgaagc cggctgcagt 34 8 0 

cactcctgct ccagtaccca gttcaagtgc aacagtggca gatgcatccc cgagcactgg 3540 

acgtgtgatg gggacaatga ttgtggggac tacagcgacg agacacacgc caactgtacc 3600 

aaccaggcta caagacctcc tggtggctgc cactcggatg agttccagtg cccgctagat 3660 

ggcctgtgca tccccctgag gtggcgctgc gacggggaca ccgactgcat ggattccagc 3720 

gatgagaaga gctgtgaggg cgtgacccat gtttgtgacc cgaatgtcaa gtttggctgc 3780 

aaggactccg cccggtgcat cagcaaggcg tgggtgtgtg atggcgacag cgactgtgaa 3 84 0 

gataactccg acgaggagaa ctgtgaggcc ctggcctgca ggccaccctc ccatccctgc 3900 

gccaacaaca cctctgtctg cctgcctcct gacaagctgt gcgacggcaa ggatgactgt 3960 

ggagacggct cggatgaggg cgagctctgt gaccagtgtt ctctgaataa tggtggctgt 4020 

agtcacaact gctcagtggc ccctggtgaa ggcatcgtgt gctcttgccc tctgggcatg 40 8 0 

gagctgggct ctgacaacca cacctgccag atccagagct actgtgccaa gcacctcaaa 4140 

tgcagccaga agtgtgacca gaacaagttc agtgtgaagt gctcctgcta cgagggctgg 42 0 0 

gtcttggagc ctgacgggga aacgtgccgc agtctggatc ccttcaaact gttcatcatc 4260 

ttctccaacc gccacgagat caggcgcatt gaccttcaca agggggacta cagcgtccta 4320 

gtgcctggcc tgcgcaacac tattgccctg gacttccacc tcagccagag tgccctctac 4380 

tggaccgacg cggtagagga caagatctac cgtgggaaac tcctggacaa cggagccctg 4440 

accagctttg aggtggtgat tcagtatggc ttggccacac cagagggcct ggctgtagat 4500 

tggattgcag gcaacatcta ctgggtggag agcaacctgg accagatcga agtggccaag 456 0 

ctggacggaa ccctccgaac cactctgctg gcgggtgaca ttgagcaccc gagggccatc 462 0 

gctctggacc ctcgggatgg gattctgttt tggacagact gggatgccag cctgccacga 4680 

atcgaggctg catccatgag tggagctggc cgccgaacca tccaccggga gacaggctct 474 0 

gggggctgcg ccaatgggct caccgtggat tacctggaga agcgcatcct ctggattgat 4800 

gctaggtcag atgccatcta ttcagcccgg tatgacggct ccggccacat ggaggtgctt 4860 

cggggacacg agttcctgtc acacccattt gccgtgacac tgtacggtgg ggaggtgtac 4 92 0 

tggaccgact ggcgaacaaa tacactggct aaggccaaca agtggactgg ccacaacgtc 4 98 0 

accgtggtac agaggaccaa cacccagccc ttcgacctgc aggtgtatca cccttcccgg 5040 

cagcccatgg ctccaaaccc atgtgaggcc aatggcggcc ggggcccctg ttcccatctg 5100 

tgcctcatca actacaaccg gaccgtctcc tgggcctgtc cccacctcat gaagctgcac 5160 

aaggacaaca ccacctgcta tgagtttaag aagttcctgc tgtacgcacg tcagatggag 522 0 
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atccggggcg tggacctgga tgccccgtac tacaattata tcatctcctt cacggtgcct 5280 

gatatcgaca atgtcacggt gctggactat gatgcccgag agcagcgagt ttactggtct 534 0 

gatgtgcgga ctcaagccat caaaagggca tttatcaacg gcactggcgt ggagaccgtt 5400 

gtctctgcag acttgcccaa cgcccacggg ctggctgtgg actgggtctc ccgaaatctg 5460 

ttttggacaa gttacgacac caacaagaag cagattaacg tggcccggct ggacggctcc 5 52 0 

ttcaagaatg cggtggtgca gggcctggag cagccccacg gcctggtcgt ccacccgctt 5580 

cgtggcaagc tctactggac tgatggggac aacatcagca tggccaacat ggatgggagc 5 64 0 

aaccacactc tgctcttcag tggccagaag ggccctgtgg ggttggccat tgacttccct 5700 

gagagcaaac tctactggat cagctctggg aaccacacaa tcaaccgttg caatctggat 5760 

gggagcgagc tggaggtcat cgacaccatg cggagccagc tgggcaaggc cactgccctg 5 82 0 

gccatcatgg gggacaagct gtggtgggca gatcaggtgt cagagaagat gggcacgtgc 5880 

aacaaagccg atggctctgg gtccgtggtg ctgcggaaca gtaccacgtt ggttatgcac 5940 

atgaaggtgt atgacgagag catccagcta gagcatgagg gcaccaaccc ctgcagtgtc 6000 

aacaacggag actgttccca gctctgcctg ccaacatcag agacgactcg ctcctgtatg 6060 

tgtacagccg gttacagcct ccggagcgga cagcaggcct gtgagggtgt gggctctttt 6120 

ctcctgtact ctgtacatga gggaattcgg gggattccac tagatcccaa tgacaagtcg 6180 

gatgccctgg tcccagtgtc cggaacttca ctggctgtcg gaatcgactt ccatgccgaa 6240 

aatgacacta tttattgggt ggatatgggc ctaagcacca tcagcagggc caagcgtgac 6300 

cagacatggc gagaggatgt ggtgaccaac ggtattggcc gtgtggaggg catcgccgtg 63 60 

gactggatcg caggcaacat atactggacg gaccagggct tcgatgtcat cgaggttgcc 642 0 

cggctcaatg gctcttttcg ttatgtggtc atttcccagg gtctggacaa gcctcgggcc 6480 

atcactgtcc acccagagaa ggggtacttg ttctggaccg agtggggtca ttacccacgt 6540 

attgagcggt ctcgccttga tggcacagag agagtggtgt tggttaatgt cagcatcagc 6600 

tggcccaatg gcatctcagt agactatcag ggcggcaagc tctactggtg tgatgctcgg 6660 

atggacaaga tcgagcgcat cgacctggaa acgggcgaga accgggaggt ggtcctgtcc 672 0 

agcaataaca tggatatgtt ctccgtgtcc gtgtttgagg acttcatcta ctggagtgac 6780 

agaactcacg ccaatggctc catcaagcgc ggctgcaaag acaatgctac agactccgtg 6840 

cctctgagga caggcattgg tgttcagctt aaagacatca aggtcttcaa cagggacagg 6900 

cagaagggta ccaatgtgtg cgcggtagcc aacggcgggt gccagcagct ctgcttgtat 6960 

cgsggtggcg gacagcgagc ctgtgcctgt gcccacggga tgctggcaga agacggggcc 7020 

tcatgccgag agtacgctgg ctacctgctc tactcagagc ggaccatcct caagagcatc 7080 

cacctgtcgg atgagcgtaa cctcaacgca ccggtgcagc cctttgaaga ccccgagcac 714 0 

atgaaaaatg tcatcgccct ggcctttgac taccgagcag gcacctcccc ggggacccct 7200 

aaccgcatct tcttcagtga catccacttt gggaacatcc agcagatcaa tgacgatggc 7260 

tcgggcagga ccaccatcgt ggaaaatgtg ggctctgtgg aaggcctggc ctatcaccgt 73 2 0 

ggctgggaca cactgtactg gacaagctac accacatcca ccatcacccg ccacaccgtg 73 80 

gaccagactc gcccaggggc cttcgagagg gagacagtca tcaccatgtc cggagacgac 7440 

cacccgagag cctttgtgct ggatgagtgc cagaacctga tgttctggac caattggaac 75 0 0 

gagctccatc caagcatcat gcgggcagcc ctatccggag ccaacgtcct gaccctcatt 7560 

gagaaggaca tccgcacgcc caatgggttg gccatcgacc accgggcgga gaagctgtac 762 0 

ttctcggatg ccaccttgga caagatcgag cgctgcgagt acgacggctc ccaccgctat 76 8 0 

gtgatcctaa agtcggagcc cgtccacccc tttgggttgg cggtgtacgg agagcacatt 774 0 

ttctggactg actgggtgcg gcgggctgtg cagcgagcca acaagtatgt gggcagcgac 7800 

atgaagctgc ttcgggtgga cattccccag caacccatgg gcatcatcgc cgtggccaat 7860 

gacaccaaca gctgtgaact ctccccctgc cgtatcaaca atggaggctg ccaggatctg 792 0 

tgtctgctca cccaccaagg ccacgtcaac tgttcctgtc gagggggccg gatcctccag 7980 

gaggacttca cctgccgggc tgtgaactcc tcttgtcggg cacaagatga gtttgagtgt 8 04 0 

gccaatgggg aatgtatcag cttcagcctc acctgtgatg gcgtctccca ctgcaaggac 8100 

aagtccgatg agaagccctc ctactgcaac tcacgccgct gcaagaagac tttccgccag 8160 

tgtaacaatg gccgctgtgt atccaacatg ctgtggtgca atggggtgga ttactgtggg 822 0 

gatggctctg atgagatacc ttgcaacaag actgcctgtg gtgtgggtga gttccgctgc 8280 

cgggatgggt cctgcatcgg gaactccagt cgctgcaacc agtttgtgga ttgtgaggat 8340 

gcctcggatg agatgaattg cagtgccaca gactgcagca gctatttccg cctgggcgtg 840 0 

aaaggtgtcc tcttccagcc gtgcgagcgg acatccctgt gctacgcacc tagctgggtg 8460 

tgtgatggcg ccaacgactg tggagactac agcgatgaac gtgactgtcc aggtgtgaag 8 52 0 

cgccctaggt gcccgctcaa ttactttgcc tgccccagcg ggcgctgtat ccccatgagc 8580 

tggacgtgtg acaaggagga tgactgtgag aacggcgagg atgagaccca ctgcaacaag 8 64 0 

ttctgctcag aggcacagtt cgagtgccag aaccaccggt gtatctccaa gcagtggctg 8700 
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tgtgacggta gcgatgattg cggggatggc tccgatgagg cagctcactg tgaaggcaag 8760 

acatgtggcc cctcctcctt ctcctgtccc ggcacccacg tgtgtgtccc tgagcgctgg 8820 

ctctgtgatg gcgacaagga ctgtaccgat ggcgcggatg agagtgtcac tgctggctgc 8880 

ctgtacaaca gcacctgtga tgaccgtgag ttcatgtgcc agaaccgctt gtgtattccc 8940 

aagcatttcg tgtgcgacca tgaccgtgac tgtgctgatg gctctgatga atcccctgag 9000 

tgtgagtacc caacctgcgg gcccaatgaa ttccgctgtg ccaatgggcg ttgtctgagc 9060 

tcccgtcagt gggaatgtga tggggagaat gactgtcacg accacagcga tgaggctccc 9120 

aagaacccac actgcaccag cccagagcac aaatgcaatg cctcatcaca gttcctgtgc 9180 

agcagcgggc gctgcgtggc tgaggcgttg ctctgcaacg gccaggacga ctgtggggac 9240 

ggttcagacg aacgcgggtg ccatgtcaac gagtgtctca gccgcaagct cagtggctgc 93 00 

agtcaggact gcgaggacct caagataggc tttaagtgcc gctgtcgccc gggcttccgg 9360 

ctaaaggacg atggcaggac ctgtgccgac ctggatgagt gcagcaccac cttcccctgc 9420 

agccagctct gcatcaacac ccacggaagt tacaagtgtc tgtgtgtgga gggctatgca 9480 

ccccgtggcg gtgaccccca cagctgcaaa gctgtgaccg atgaggagcc atttctcatc 9540 

tttgccaacc ggtactacct gcggaagctc aacctggacg gctccaacta cacactgctt 9600 

aagcagggcc tgaacaatgc ggtcgccttg gcatttgact accgagagca gatgatctac 9660 

tggacgggcg tgaccaccca gggcagcatg attcgcagga tgcacctcaa cggcagcaac 9720 

gtgcaggttc tgcaccggac gggccttagt aacccagatg ggctcgctgt ggactgggtg 9780 

ggtggcaacc tgtactggtg tgacaagggc agagatacca ttgaggtgtc caagcttaac 9840 

ggggcctatc ggacagtgct ggtcagctct ggcctccggg agcccagagc tctggtagtg 9900 

gatgtacaga atgggtacct gtactggaca gactggggtg accactcact gatcggccgg 99S0 

attggcatgg atggatctgg ccgcagcatc atcgtggaca ctaagatcac atggcccaat 10020 

ggcctgaccg tggactacgt cacggaacgc atctactggg ctgacgcccg tgaggactac 100 80 

atcgagttcg ccagcctgga tggctccaac cgtcacgttg tgctgagcca agacatccca 10140 

cacatctttg cgctgaccct atttgaagac tacgtctact ggacagactg ggaaacgaag 10200 

tccatcaacc gggcccacaa gaccacgggt gccaacaaaa cactcctcat cagcaccctg 10260 

caccggccca tggacttaca tgtattccac gccctgcgcc agccagatgt gcccaatcac 10320 

ccctgcaaag tcaacaatgg tggctgcagc aacctgtgcc tgctgtcccc tgggggtggt 103 80 

cacaagtgcg cctgccccac caacttctat ctgggtggcg atggccgtac ctgtgtgtcc 10440 

aactgcacag caagccagtt tgtgtgcaaa aatgacaagt gcatcccctt ctggtggaag 10500 

tgtgacacgg aggacgactg tggggatcac tcagacgagc ctccagactg tcccgagttc 10560 

aagtgccgcc caggccagtt ccagtgctcc accggcatct gcaccaaccc tgccttcatc 10620 

tgtgatgggg acaatgactg ccaagacaat agtgacgagg ccaattgcga cattcacgtc 10680 

tgcttgccca gccaattcaa gtgcaccaac accaaccgct gcattcctgg catcttccgt 10740 

tgcaatgggc aggacaactg cggggacggc gaggatgagc gggattgccc tgaggtgacc 10800 

tgcgccccca accagttcca gtgctccatc accaagcgct gcatccctcg cgtctgggtc 10860 

tgtgacaggg ataatcactg tgtggacggc agtgatgagc ctgccaactg tacccaaatg 10920 

acctgtggag tggatgagtt ccgctgcaag gattctggcc gctgcatccc cgcgcgctgg 10980 

aagtgtgacg gagaagatga ctgtggggat ggttcagatg agcccaagga agagtgtgat 1104 0 

gagcgcacct gtgagccata ccagttccgc tgcaaaaaca accgctgtgt cccaggccgt 11100 

tggcaatgtg actacgacaa cgactgcgga gataactcgg acgaggagag ctgcacacct 11160 

cggccctgct ctgagagtga gtttttctgt gccaatggcc gctgcatcgc tgggcgctgg 1122 0 

aagtgtgatg gggaccatga ctgtgccgac ggctcagacg agaaagactg caccccccgc 11280 

tgtgatatgg accagttcca gtgcaagagt ggccactgca tccccctgcg ctggccgtgt 11340 

gacgcggatg ctgactgtat ggacggcagt gacgaggaag cctgtggcac tggggtgagg 11400 

acctgcccat tggatgagtt tcaatgtaac aacaccttgt gcaagccgct ggcctggaag 11460 

tgtgatggag aggacgactg tggggacaac tcagatgaga accccgagga atgcgcccgg 1152 0 

ttcatctgcc ctcccaaccg gcctttccgc tgcaagaatg accgagtctg cctgtggatt 11580 

gggcgccagt gtgatggcgt ggacaactgt ggagatggga ctgacgagga ggactgtgag 1164 0 

ccccccacgg cccagaaccc ccactgcaaa gacaagaagg agttcctgtg ccgaaaccag 11700 

cgctgtctat catcctccct gcgctgtaac atgttcgatg actgcggcga tggctccgat 11760 

gaagaagatt gcagcatcga ccccaagctg accagctgtg ccaccaatgc cagcatgtgt 1182 0 

ggggacgaag ctcgttgtgt gcgcactgag aaagctgcct actgtgcctg ccgctcgggc 1188 0 

ttccatactg tgccgggcca gcccggatgc caggacatca acgagtgcct gcgctttggt 11940 

acctgctctc agctctggaa caaacccaag ggaggccacc tctgcagctg tgcccgcaac 12000 

ttcatgaaga cacacaacac ctgcaaagct gaaggctccg agtaccaggt gctatacatc 12060 

gcggatgaca acgagatccg cagcttgttc ccgggccacc cccactcagc ctacgagcag 1212 0 

acattccagg gcgatgagag tgtccgcata gatgccatgg atgtccatgt caaggccggc 1218 0 
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cgtgtctact ggactaactg gcacacgggc acaatctcct acaggagcct gccccctgcc 12240 

gcccctccta ccacttccaa ccgccaccgg aggcagatcg accggggtgt cacccacctc 12300 

aatatttcag ggctgaagat gccgaggggt atcgctatcg actgggtggc cgggaatgtg 12 3 60 

tactggaccg attccggccg agacgtgatt gaggtggcgc aaatgaaggg cgagaaccgc 12420 

aagacgctca tctcgggcat gattgatgag ccccatgcca tcgtggtgga ccctctgagg 12480 

ggcaccatgt actggtcaga ctgggggaac caccccaaga ttgaaacagc agcgatggat 12540 

ggcacccttc gggagactct cgtgcaagac aacattcagt ggcctacagg gctggctgtg 12600 

gactatcaca atgaacggct ctactgggca gatgccaagc tttcggtcat cggcagcatc 12660 

cggctcaacg gcactgaccc cattgtggct gctgacagca aacgaggcct aagtcacccc 12720 

ttcagcatcg atgtgtttga agactacatc tacggagtca cttacatcaa taatcgtgtc 12780 

ttcaagatcc acaagtttgg acacagcccc ttgtacaacc taactggggg cctgagccat 12 840 

gcctctgatg tagtccttta ccatcaacac aagcagcctg aagtgaccaa cccctgtgac 12900 

cgcaagaaat gcgaatggct gtgtctgctg agccccagcg ggcctgtctg cacctgtccc 12960 

aatggaaaga ggctggataa tggcacctgt gtgcctgtgc cctctccaac accccctcca 13020 

gatgccccta ggcctggaac ctgcactctg cagtgcttca atggtggtag ttgtttcctc 13080 

aacgctcgga ggcagcccaa gtgccgttgc cagccccgtt acacaggcga taagtgtgag 13140 

ctggatcagt gctgggaata ctgtcacaac ggaggcacct gtgcggcttc cccatctggc 132 00 

atgcccacgt gccgctgtcc cactggcttc acgggcccca aatgcaccgc acaggtgtgt 13260 

gcaggctact gctctaacaa cagcacctgc accgtcaacc agggcaacca gccccagtgc 13320 

cgatgtctac ctggcttcct gggcgaccgt tgccagtacc ggcagtgctc tggcttctgt 13380 

gagaactttg gcacctgtca gatggctgct gatggctccc gacaatgtcg ctgcaccgtc 13440 

tactttgagg gaccaaggtg tgaggtgaac aagtgtagtc gctgtctcca aggcgcctgt 13500 

gtggtcaata agcagaccgg agatgtcaca tgcaactgca ctgatggccg ggtagccccc 13560 

agttgtctca cctgcatcga tcactgtagc aatggtggct cctgcaccat gaacagcaag 13620 

atgatgcctg agtgccagtg cccgccccat atgacaggac cccggtgcca ggagcaggtt 13 680 

gttagtcagc aacagcctgg gcatatggcc tccatcctga tccctctgct gctgcttctc 13740 

ctgctgcttc tggtggctgg cgtggtgttc tggtataagc ggcgagtccg aggggctaag 13800 

ggcttccagc accagcggat gaccaatggg gccatgaatg tggaaattgg aaaccctacc 13 8 60 

tacaagatgt atgaaggtgg agagcccgat gatgtcgggg gcctactgga tgctgatttt 13 92 0 

gcccttgacc ctgacaagcc taccaacttc accaacccag tgtatgccac gctctacatg 13980 

gggggccacg gcagccgcca ttccctggcc agcacggacg agaagcgaga actgctgggc 14040 

cggggacctg aagacgagat aggagatccc ttggcatagg gccctgcccc gacggatgtc 14100 

cccagaaagc cccctgccac atgagtcttt caatgaaccc cctccccagc cggcccttct 14160 

ccggccctgc cgggtgtaca aatgtaaaaa tgaaggaatt actttttata tgtgagcgag 142 2 0 

caagcgagca agcacagtat tatctctttg catttccttc ctgcctgctc ctcagtatcc 142 80 

cccccatgct gccttgaggg ggcggggagg gctttgtggc tcaaaggtat gaaggagtcc 14340 

acatgttccc taccgagcat acccctggaa gcctggcggc acggcctccc caccacgcct 14400 

gtgcaagaca ctcaacgggg ctccgtgtcc cagctttcct ttccttggct ctctggggtt 14460 

agttcagggg aggtggagtc ctctgctgac cctgtctgga agatttggct ctagctgagg 14520 

aaggagtctt ttagttgagg gaagtcaccc caaaccccag ctcccacttt caggggcacc 14580 

tctcagatgg ccatgctcag tatcccttcc agacaggccc tcccctctct agcgccccct 14640 

ctgtggctcc tagggctgaa cacattcttt ggtaactgtc ccccaagcct cccatccccc 14700 

tgagggccag gaagagtcgg ggcacaccaa ggaagggcaa gcgggcagcc ccattttggg 14760 

gacgtgaacg ttttaataat ttttgctgaa ttcctttaca actaaataac acagatattg 14820 

ttataaataa aattgtaaaa aaaaaaaaa 14849 
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<211> 4545 

<212> PRT 

<213> Mus musculus 

<400> 2 

Met Leu Thr Pro Pro Leu Leu Leu Leu Val Pro Leu Leu Ser Ala Leu 

15 10 15 

Val Ser Gly Ala Thr Met Asp Ala Pro Lys Thr Cys Ser Pro Lys Gin 
20 25 30 
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Phe 


Ala 


Cys 


Arg 


Asp 


Gin 


He 


Thr 


Cys 


He 


Ser 


Lys 


Gly 


Trp 


Arg 


Cys 






35 










40 










45 








Asp 


Gly 


Glu 


Arg 


Asp 


Cys 


Pro 


Asp 


Gly 


Ser 


Asp 


Glu 


Ala 


Pro 


Glu 


He 




50 










55 










60 










Cys 


Pro 


Gin 


Ser 


Lys 


Ala 


Gin 


Arg 


Cys 


Pro 


Pro 


Asn 


Glu 


His 


Ser 


Cys 


65 










70 










75 










80 


Leu 


Gly 


Thr 


Glu 


Leu 


Cys 


Val 


Pro 


Met 


Ser 


Arg 


Leu 


Cys 


Asn 


Gly 


He 










85 










90 










95 




Gin 


Asp 


Cys 


Met 


Asp 


Gly 


Ser 


Asp 


Glu 


Gly 


Ala 


His 


Cys 


Arg 


Glu 


Leu 








100 










105 










110 






Arg 


Ala 


Asn 


Cys 


Ser 


Arg 


Met 


Gly 


Cys 


Gin 


His 


His 


Cys 


Val 


Pro 


Thr 






115 










120 










125 








Pro 


Ser 


Gly 


Pro 


Thr 


Cys 


Tyr 


Cys 


Asn 


Ser 


Ser 


Phe 


Gin 


Leu 


Glu 


Ala 




130 










135 










14 0 










Asp 


Gly 


Lys 


Thr 


Cys 


Lys 


Asp 


Phe 


Asp 


Glu 


Cys 


Ser 


Val 


Tyr 


Gly 


Thr 


145 










150 










155 










160 


Cys 


Ser 


Gin 


Leu 


Cys 


Thr 


Asn 


Thr 


Asp 


Gly 


Ser 


Phe 


Thr 


Cys 


Gly 


Cys 










165 










170 










175 




Val 


Glu 


Gly 


Tyr 


Leu 


Leu 


Gin 


Pro 


Asp 


Asn 


Arg 


Ser 


Cys 


Lys 


Ala 


Lys 








180 










185 










190 






Asn 


Glu 


Pro 


Val 


Asp 


Arg 


Pro 


Pro 


Val 


Leu 


Leu 


He 


Ala 


Asn 


Ser 


Gin 






195 










200 










205 








Asn 


He 


Leu 


Ala 


Thr 


Tyr 


Leu 


Ser 


Gly 


Ala 


Gin 


Val 


Ser 


Thr 


He 


Thr 




210 










215 










220 










Pro 


Thr 


Ser 


Thr 


Arg 


Gin 


Thr 


Thr 


Ala 


Met 


Asp 


Phe 


Ser 


Tyr 


Ala 


Asn 


225 










230 










235 










240 


Glu 


Thr 


Val 


Cys 


Trp 


Val 


His 


Val 


Gly 


Asp 


Ser 


Ala 


Ala 


Gin 


Thr 


Gin 










245 










250 










255 




Leu 


Lys 


Cys 


Ala 


Arg 


Met 


Pro 


Gly 


Leu 


Lys 


Gly 


Phe 


Val 


Asp 


Glu 


His 








260 










265 










270 






Thr 


He 


Asn 


He 


Ser 


Leu 


Ser 


Leu 


His 


His 


Val 


Glu 


Gin 


Met 


Ala 


He 






275 










280 










285 








Asp 


Trp 


Leu 


Thr 


Gly 


Asn 


Phe 


Tyr 


Phe 


Val 


Asp 


Asp 


He 


Asp 


Asp 


Arg 




290 










295 










300 










He 


Phe 


Val 


Cys 


Asn 


Arg 


Asn 


Gly 


Asp 


Thr 


Cys 


Val 


Thr 


Leu 


Leu 


Asp 


305 










310 










315 










320 


Leu 


Glu 


Leu 


Tyr 


Asn 


Pro 


Lys 


Gly 


He 


Ala 


Leu 


Asp 


Pro 


Ala 


Met 


Gly 










325 










330 










335 




Lys 


Val 


Phe 


Phe 


Thr 


Asp 


Tyr 


Gly 


Gin 


He 


Pro 


Lys 


Val 


Glu 


Arg 


Cys 








340 










345 










350 






Asp 


Met 


Asp 


Gly 


Gin 


Asn 


Arg 


Thr 


Lys 


Leu 


Val 


Asp 


Ser 


Lys 


He 


Val 






355 










360 










365 








Phe 


Pro 


His 


Gly 


He 


Thr 


Leu 


Asp 


Leu 


Val 


Ser 


Arg 


Leu 


Val 


Tyr 


Trp 




370 










375 










380 










Ala 


Asp 


Ala 


Tyr 


Leu 


Asp 


Tyr 


He 


Glu 


Val 


Val 


Asp 


Tyr 


Glu 


Gly 


Lys 


385 










390 










395 










400 


Gly 


Arg 


Gin 


Thr 


He 


He 


Gin 


Gly 


He 


Leu 


He 


Glu 


His 


Leu 


Tyr 


Gly 










405 










410 










415 




Leu 


Thr 


Val 


Phe 


Glu 


Asn 


Tyr 


Leu 


Tyr 


Ala 


Thr 


Asn 


Ser 


Asp 


Asn 


Ala 








420 










425 










430 






Asn 


Thr 


Gin 


Gin 


Lys 


Thr 


Ser 


Val 


He 


Arg 


Val 


Asn 


Arg 


Phe 


Asn 


Ser 






435 










440 










445 








Thr 


Glu 


Tyr 


Gin 


Val 


Val 


Thr 


Arg 


Val 


Asp 


Lys 


Gly 


Gly 


Ala 


Leu 


His 




450 










455 










460 










He 


Tyr 


His 


Gin 


Arg 


Arg 


Gin 


Pro 


Arg 


Val 


Arg 


Ser 


His 


Ala 


Cys 


Glu 


465 










470 










475 










480 


Asn 


Asp 


Gin 


Tyr 


Gly 


Lys 


Pro 


Gly 


Gly 


Cys 


Ser 


Asp 


He 


Cys 


Leu 


Leu 










485 










490 










495 
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Ala 


Asn 


Ser 


His 


Lys 


Ala 


Arg 


Thr 


Cys 


Arg 


Cys 


Arg 


Ser 


Gly 


Phe 


Ser 








500 










505 










510 






Leu 


Gly 


Ser 


Asp 


Gly 


Lys 


Ser 


Cys 


Lys 


Lys 


Pro 


Glu 


His 


Glu 


Leu 


Phe 






515 










520 










525 








Leu 


Val 


Tyr 


Gly 


Lys 


Gly 


Arg 


Pro 


Gly 


He 


He 


Arg 


Gly 


Met 


Asp 


Met 




530 










535 










540 










Gly 


Ala 


Lys 


Val 


Pro 


Asp 


Glu 


His 


Met 


He 


Pro 


He 


Glu 


Asn 


Leu 


Met 


545 










550 










555 










560 


Asn 


Pro 


Arg 


Ala 


Leu 


Asp 


Phe 


His 


Ala 


Glu 


Thr 


Gly 


Phe 


He 


Tyr 


Phe 










565 










570 










575 




Ala 


Asp 


Thr 


Thr 


Ser 


Tyr 


Leu 


He 


Gly 


Arg 


Gin 


Lys 


He 


Asp 


Gly 


Thr 








580 










585 










590 






Glu 


Arg 


Glu 


Thr 


He 


Leu 


Lys 


Asp 


Gly 


He 


His 


Asn 


Val 


Glu 


Gly 


Val 






595 










600 










605 








Ala 


Val 


Asp 


Trp 


Met 


Gly 


Asp 


Asn 


Leu 


Tyr 


Trp 


Thr 


Asp 


Asp 


Gly 


Pro 




610 










615 










620 










Lys 


Lys 


Thr 


He 


Ser 


Val 


Ala 


Arg 


Leu 


Glu 


Lys 


Ala 


Ala 


Gin 


Thr 


Arg 


625 










630 










635 










640 


Lys 


Thr 


Leu 


He 


Glu 


Gly 


Lys 


Met 


Thr 


His 


Pro 


Arg 


Ala 


He 


Val 


Val 










645 










650 










655 




Asp 


Pro 


Leu 


Asn 


Gly 


Trp 


Met 


Tyr 


Trp 


Thr 


Asp 


Trp 


Glu 


Glu 


Asp 


Pro 








660 










665 










670 






Lys 


Asp 


Ser 


Arg 


Arg 


Gly 


Arg 


Leu 


Glu 


Arg 


Ala 


Trp 


Met 


Asp 


Gly 


Ser 






675 










680 










685 








His 


Arg 


Asp 


He 


Phe 


Val 


Thr 


Ser 


Lys 


Thr 


Val 


Leu 


Trp 


Pro 


Asn 


Gly 




690 










695 










700 










Leu 


Ser 


Leu 


Asp 


He 


Pro 


Ala 


Gly 


Arg 


Leu 


Tyr 


Trp 


Val 


Asp 


Ala 


Phe 


705 










710 










715 










720 


Tyr 


Asp 


Arg 


He 


Glu 


Thr 


He 


Leu 


Leu 


Asn 


Gly 


Thr 


Asp 


Arg 


Lys 


He 










725 










730 










735 




Val 


Tyr 


Glu 


Gly 


Pro 


Glu 


Leu 


Asn 


His 


Ala 


Phe 


Gly 


Leu 


Cys 


His 


His 








740 










745 










750 






Gly 


Asn 


Tyr 


Leu 


Phe 


Trp 


Thr 


Glu 


Tyr 


Arg 


Ser 


Gly 


Ser 


Val 


Tyr 


Arg 






755 










760 










765 








Leu 


Glu 


Arg 


Gly 


Val 


Ala 


Gly 


Ala 


Pro 


Pro 


Thr 


Val 


Thr 


Leu 


Leu 


Arg 




770 










775 










780 










Ser 


Glu 


Arg 


Pro 


Pro 


He 


Phe 


Glu 


He 


Arg 


Met 


Tyr 


Asp 


Ala 


His 


Glu 


785 










790 










795 










800 


Gin 


Gin 


Val 


Gly 


Thr 


Asn 


Lys 


Cys 


Arg 


Val 


Asn 


Asn 


Gly 


Gly 


Cys 


Ser 










805 










810 










815 




Ser 


Leu 


Cys 


Leu 


Ala 


Thr 


Pro 


Gly 


Ser 


Arg 


Gin 


Cys 


Ala 


Cys 


Ala 


Glu 








820 










825 










830 






Asp 


Gin 


Val 


Leu 


Asp 


Thr 


Asp 


Gly 


Val 


Thr 


Cys 


Leu 


Ala 


Asn 


Pro 


Ser 






835 










840 










845 








Tyr 


Val 


Pro 


Pro 


Pro 


Gin 


Cys 


Gin 


Pro 


Gly 


Gin 


Phe 


Ala 


Cys 


Ala 


Asn 




850 










855 










860 










Asn 


Arg 


Cys 


He 


Gin 


Glu 


Arg 


Trp 


Lys 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


865 










870 










875 










880 


Leu 


Asp 


Asn 


Ser 


Asp 


Glu 


Ala 


Pro 


Ala 


Leu 


Cys 


His 


Gin 


His 


Thr 


Cys 










885 










890 










895 




Pro 


Ser 


Asp 


Arg 


Phe 


Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


Arg 








900 










905 










910 






Trp 


Leu 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 


Ser 






915 










920 










925 








Asn 


Ala 


Thr 


Cys 


Ser 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 




930 










935 










940 










Ala 


Ser 


Gly 


Arg 


Cys 


He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


945 










950 
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955 










960 



Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn 

980 985 990 

lie Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys lie Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
1025 1030 1035 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys lie Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys lie 
1105 1110 1115 1120 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1140 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
1185 1190 1195 1200 

Pro Gly Glu Gly lie Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin lie Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe lie lie Phe Ser Asn Arg His Glu lie 
1265 1270 1275 1280 

Arg Arg lie Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr lie Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys lie Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val lie Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp lie Ala Gly Asn lie Tyr 
1345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin lie Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp lie Glu His Pro Arg Ala 

1380 1385 1390 

lie Ala Leu Asp Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp 

1395 1400 1405 

Ala Ser Leu Pro Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg 
1410 1415 1420 



Arg Thr lie His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
1425 1430 1435 1440 

Thr Val Asp Tyr Leu Glu Lys Arg lie Leu Trp lie Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala lie Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
1505 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu lie Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu lie Arg Gly Val Asp Leu Asp 
1585 1590 1595 1600 

Ala Pro Tyr Tyr Asn Tyr lie lie Ser Phe Thr Val Pro Asp lie Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala lie Lys Arg Ala Phe lie Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
1665 1670 1675 1680 

Asn Lys Lys Gin lie Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn lie Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala lie Asp Phe Pro Glu Ser Lys Leu Tyr Trp lie 
1745 1750 1755 1760 

Ser Ser Gly Asn His Thr lie Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val lie Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala lie Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
1825 1830 1835 1840 

lie Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 

1860 1865 1870 

Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 
1875 1880 1885 
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Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly lie Arg Gly 

1890 1895 1900 

lie Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
1905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly lie Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

lie Tyr Trp Val Asp Met Gly Leu Ser Thr lie Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly lie Gly Arg Val 

1955 1960 1965 

Glu Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
1985 1990 1995 2000 

Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
2065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
2145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
2225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
2305 2310 2315 2320 

Phe Glu Arg Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 
2340 2345 2350 
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Asn Glu Leu His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn 

2355 2360 2365 

Val Leu Thr Leu lie Glu Lys Asp lie Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

lie Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
2385 2390 2395 2400 

Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val lie Leu 

2405 2410 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

lie Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp lie Pro Gin Gin 

2450 2455 2460 

Pro Met Gly lie lie Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2465 2470 2475 2480 

Ser Pro Cys Arg lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg lie Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys lie Ser Phe Ser Leu Thr 

2530 2535 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
2545 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu lie Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 2600 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys lie Gly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
2625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys lie Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
2705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys lie Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 2760 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 

2770 2775 2780 

Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
2785 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 
2805 2810 2815 
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Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys lie 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
2865 2870 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
2945 2950 2955 2960 

Lys lie Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu lie Phe Ala Asn Arg Tyr Tyr Leu 
3025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met lie 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met lie Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
3105 3110 3115 3120 

Asp Lys Gly Arg Asp Thr lie Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu lie Gly Arg lie Gly Met Asp Gly Ser Gly Arg Ser lie lie 

3170 3175 3180 

Val Asp Thr Lys lie Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
3185 3190 3195 3200 

Thr Glu Arg lie Tyr Trp Ala Asp Ala Arg Glu Asp Tyr lie Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp lie 

3220 3225 3230 

Pro His lie Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 

Asp Trp Glu Thr Lys Ser lie Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu lie Ser Thr Leu His Arg Pro Met Asp Leu His 
3265 3270 3275 3280 
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Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 

3285 3290 3295 

Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3330 3335 3340 

Asp Lys Cys He Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
3345 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
3425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
3505 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
3585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
3665 3670 3675 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

He Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 
3730 3735 3740 
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Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
3745 3750 3755 3760 

Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin 

3810 3815 3820 

Asp lie Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
3825 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

lie Ala Asp Asp Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg lie Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
3905 3910 3915 3920 

His Thr Gly Thr lie Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr Thr Ser Asn Arg His Arg Arg Gin lie Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn lie Ser Gly Leu Lys Met Pro Arg Gly lie Ala lie Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val lie Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu lie Ser Gly Met 
3985 3990 3995 4000 

lie Asp Glu Pro His Ala lie Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys lie Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn lie Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val lie Gly Ser lie Arg Leu Asn Gly Thr Asp Pro 
4065 4070 4075 4080 

lie Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser lie 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr lie Tyr Gly Val Thr Tyr lie Asn Asn Arg 

4100 4105 4110 

Val Phe Lys lie His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
4145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 
4195 4200 4205 
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Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
4225 4230 4235 4240 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
4305 4310 4315 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys lie Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
4385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser lie Leu lie Pro Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
4465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu 
4530 4535 4540 

Ala 
4545 

<210> 3 

<211> 4577 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact 60 

ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc 120 

tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa 180 

gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc 240 

tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg 3 00 

tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca 360 

agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga 420 
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cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt 480 

tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata 540 

cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg 600 

cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt 660 

ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct 72 0 

tcccaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat 780 

gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt 84 0 

gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt 90 0 

ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac 96 0 

caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat 102 0 

ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac 1080 

cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt 1140 

tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag tcatattcat 1200 

cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gccttgtaca 1260 

gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa 1320 

ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca 13 8 0 

cactgcttat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca 1440 

tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac 1500 

cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt 1560 

ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggcc atttttccat 162 0 

ctcaatccct gtgaagtcag acattgctcc tgtcgctcgg ttgctcatct atgctgtttt 1680 

acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa 1740 

caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg 1800 

agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct 1860 

catgaagcct gatgctgagc tctcggcgtc ctcggtttac aacctgctac cagaaaagga 192 0 

cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgca tcaatcgtca 198 0 

taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat 2 04 0 

gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttcgtaaacc 210 0 

caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg 2160 

tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagcc 222 0 

tcacacggag accgtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt 2280 

aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa 2340 

ggcaggggcc ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct 2400 

ccgagccttc cagcccttct ttgtggagct tacaatgcct tactctgtga ttcgtggaga 2460 

ggccttcaca ctcaaggcca cggtcctaaa ctaccttccc aaatgcatcc gggtcagtgt 2520 

gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca 2580 

ctgcatctgt gcaaacgggc ggcaaactgt gtcctgggca gtaaccccaa agtcattagg 2640 

aaatgtgaat ttcactgtga gcgcagaggc actagagtct caagagctgt gtgggactga 2700 

ggtgccttca gttcctgaac acggaaggaa agacacagtc atcaagcctc tgttggttga 2760 

acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga 2 820 

ggtttctgaa gaattatccc tgaaactgcc accaaatgtg gtagaagaat ctgcccgagc 2880 

ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct 2 940 

ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgctccta acatctatgt 3 000 

actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggccattgg 3060 

ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag 3120 

cacctttggg gagcgatatg gcaggaacca gggcaacacc tggctcacag cctttgttct 3180 

gaagactttt gcccaagctc gagcctacat cttcatcgat gaagcacaca ttacccaagc 3240 

cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact 33 00 

gctcaacaat gccataaagg gaggagtaga agatgaagtg accctctccg cctatatcac 33 60 

catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt 342 0 

ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac 3480 

caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt 3540 

actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc 3600 

tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt 3660 

ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga 372 0 

ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg 3780 

cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc 3840 

cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt 3900 
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ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc 3960 

agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac 402 0 

ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca 4080 

gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag 4140 

tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt 4200 

ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag 4260 

ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac 432 0 

actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat 43 8 0 

agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc 444 0 

ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag 450 0 

tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat 4560 

aaacactttt tctggtc 4577 

<210> 4 

<211> 4422 

<212> DNA 

<213> Homo sapiens 

<400> 4 

atggggaaga acaaactcct tcatccaagt ctggttcttc tcctcttggt cctcctgccc 60 

acagacgcct cagtctctgg aaaaccgcag tatatggttc tggtcccctc cctgctccac 120 

actgagacca ctgagaaggg ctgtgtcctt ctgagctacc tgaatgagac agtgactgta 180 

agtgcttcct tggagtctgt caggggaaac aggagcctct tcactgacct ggaggcggag 24 0 

aatgacgtac tccactgtgt cgccttcgct gtcccaaagt cttcatccaa tgaggaggta 300 

atgttcctca ctgtccaagt gaaaggacca acccaagaat ttaagaagcg gaccacagtg 3 60 

atggttaaga acgaggacag tctggtcttt gtccagacag acaaatcaat ctacaaacca 42 0 

gggcagacag tgaaatttcg tgttgtctcc atggatgaaa actttcaccc cctgaatgag 480 

ttgattccac tagtatacat tcaggatccc aaaggaaatc gcatcgcaca atggcagagt 540 

ttccagttag agggtggcct caagcaattt tcttttcccc tctcatcaga gcccttccag 600 

ggctcctaca aggtggtggt acagaagaaa tcaggtggaa ggacagagca ccctttcacc 660 

gtggaggaat ttgttcttcc caagtttgaa gtacaagtaa cagtgccaaa gataatcacc 720 

atcttggaag aagagatgaa tgtatcagtg tgtggcctat acacatatgg gaagcctgtc 780 

cctggacatg tgactgtgag catttgcaga aagtatagtg acgcttccga ctgccacggt 840 

gaagattcac aggctttctg tgagaaattc agtggacagc taaacagcca tggctgcttc 900 

tatcagcaag taaaaaccaa ggtcttccag ctgaagagga aggagtatga aatgaaactt 960 

cacactgagg cccagatcca agaagaagga acagtggtgg aattgactgg aaggcagtcc 1020 

agtgaaatca caagaaccat aaccaaactc tcatttgtga aagtggactc acactttcga 1080 

cagggaattc ccttctttgg gcaggtgcgc ctagtagatg ggaaaggcgt ccctatacca 1140 

aataaagtca tattcatcag aggaaatgaa gcaaactatt actccaatgc taccacggat 1200 

gagcatggcc ttgtacagtt ctctatcaac accaccaacg ttatgggtac ctctcttact 1260 

gttagggtca attacaagga tcgtagtccc tgttacggct accagtgggt gtcagaagaa 1320 

cacgaagagg cacatcacac tgcttatctt gtgttctccc caagcaagag ctttgtccac 13 80 

cttgagccca tgtctcatga actaccctgt ggccatactc agacagtcca ggcacattat 1440 

attctgaatg gaggcaccct gctggggctg aagaagctct ccttttatta tctgataatg 1500 

gcaaagggag gcattgtccg aactgggact catggactgc ttgtgaagca ggaagacatg 15 6 0 

aagggccatt tttccatctc aatccctgtg aagtcagaca ttgctcctgt cgctcggttg 1620 

ctcatctatg ctgttttacc taccggggac gtgattgggg attctgcaaa atatgatgtt 1680 

gaaaattgtc tggccaacaa ggtggatttg agcttcagcc catcacaaag tctcccagcc 174 0 

tcacacgccc acctgcgagt cacagcggct cctcagtccg tctgcgccct ccgtgctgtg 1800 

gaccaaagcg tgctgctcat gaagcctgat gctgagctct cggcgtcctc ggtttacaac 1860 

ctgctaccag aaaaggacct cactggcttc cctgggcctt tgaatgacca ggacgatgaa 192 0 

gactgcatca atcgtcataa tgtctatatt aatggaatca catatactcc agtatcaagt 1980 

acaaatgaaa aggatatgta cagcttccta gaggacatgg gcttaaaggc attcaccaac 2 04 0 

tcaaagattc gtaaacccaa aatgtgtcca cagcttcaac agtatgaaat gcatggacct 2100 

gaaggtctac gtgtaggttt ttatgagtca gatgtaatgg gaagaggcca tgcacgcctg 2160 

gtgcatgttg aagagcctca cacggagacc gtacgaaagt acttccctga gacatggatc 2220 

tgggatttgg tggtggtaaa ctcagcaggg gtggctgagg taggagtaac agtccctgac 22 8 0 

accatcaccg agtggaaggc aggggccttc tgcctgtctg aagatgctgg acttggtatc 2340 
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tcttccactg cctctctccg agccttccag cccttctttg tggagcttac aatgccttac 2400 

tctgtgattc gtggagaggc cttcacactc aaggccacgg tcctaaacta ccttcccaaa 2460 

tgcatccggg tcagtgtgca gctggaagcc tctcccgcct tccttgctgt cccagtggag 2520 

aaggaacaag cgcctcactg catctgtgca aacgggcggc aaactgtgtc ctgggcagta 2580 

accccaaagt cattaggaaa tgtgaatttc actgtgagcg cagaggcact agagtctcaa 2640 

gagctgtgtg ggactgaggt gccttcagtt cctgaacacg gaaggaaaga cacagtcatc 2700 

aagcctctgt tggttgaacc tgaaggacta gagaaggaaa caacattcaa ctccctactt 2760 

tgtccatcag gtggtgaggt ttctgaagaa ttatccctga aactgccacc aaatgtggta 2 82 0 

gaagaatctg cccgagcttc tgtctcagtt ttgggagaca tattaggctc tgccatgcaa 2880 

aacacacaaa atcttctcca gatgccctat ggctgtggag agcagaatat ggtcctcttt 2940 

gctcctaaca tctatgtact ggattatcta aatgaaacac agcagcttac tccagaggtc 3000 

aagtccaagg ccattggcta tctcaacact ggttaccaga gacagttgaa ctacaaacac 3060 

tatgatggct cctacagcac ctttggggag cgatatggca ggaaccaggg caacacctgg 312 0 

ctcacagcct ttgttctgaa gacttttgcc caagctcgag cctacatctt catcgatgaa 3180 

gcacacatta cccaagccct catatggctc tcccagaggc agaaggacaa tggctgtttc 3240 

aggagctctg ggtcactgct caacaatgcc ataaagggag gagtagaaga tgaagtgacc 3300 

ctctccgcct atatcaccat cgcccttctg gagattcctc tcacagtcac tcaccctgtt 3360 

gtccgcaatg ccctgttttg cctggagtca gcctggaaga cagcacaaga aggggaccat 342 0 

ggcagccatg tatataccaa agcactgctg gcctatgctt ttgccctggc aggtaaccag 3480 

gacaagagga aggaagtact caagtcactt aatgaggaag ctgtgaagaa agacaactct 354 0 

gtccattggg agcgccctca gaaacccaag gcaccagtgg ggcattttta cgaaccccag 3600 

gctccctctg ctgaggtgga gatgacatcc tatgtgctcc tcgcttatct cacggcccag 3660 

ccagccccaa cctcggagga cctgacctct gcaaccaaca tcgtgaagtg gatcacgaag 3720 

cagcagaatg cccagggcgg tttctcctcc acccaggaca cagtggtggc tctccatgct 3780 

ctgtccaaat atggagccgc cacatttacc aggactggga aggctgcaca ggtgactatc 3 84 0 

cagtcttcag ggacattttc cagcaaattc caagtggaca acaacaatcg cctgttactg 3900 

cagcaggtct cattgccaga gctgcctggg gaatacagca tgaaagtgac aggagaagga 3 96 0 

tgtgtctacc tccagacctc cttgaaatac aatattctcc cagaaaagga agagttcccc 402 0 

tttgctttag gagtgcagac tctgcctcaa acttgtgatg aacccaaagc ccacaccagc 4080 

ttccaaatct ccctaagtgt cagttacaca gggagccgct ctgcctccaa catggcgatc 4140 

gttgatgtga agatggtctc tggcttcatt cccctgaagc caacagtgaa aatgcttgaa 4200 

agatctaacc atgtgagccg gacagaagtc agcagcaacc atgtcttgat ttaccttgat 4260 

aaggtgtcaa atcagacact gagcttgttc ttcacggttc tgcaagatgt cccagtaaga 4320 

gatctcaaac cagccatagt gaaagtctat gattactacg agacggatga gtttgcaatc 43 8 0 

gctgagtaca atgctccttg cagcaaagat cttggaaatg ct 4422 

<210> 5 

<211> 1474 

<212> PRT 

<213> Homo sapiens 





<400> 


5 


























Met 


Gly 


Lys 


Asn 


Lys 


Leu 


Leu 


His 


Pro 


Ser 


Leu 


Val 




Leu 


Leu 


Leu 


1 








5 










10 










15 




Val 


Leu 


Leu 


Pro 
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Val 
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Ser 


Gly 


Lys 


Pro 
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Tyr 


Met 


Val 


Leu 


Val 
35 


Pro 


Ser 


Leu 


Leu 


His 
40 


Thr 
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Thr 


Thr 


Glu 
45 


Lys 


Gly 


Cys 


Val 


Leu 
50 


Leu 


Ser 


Tyr 


Leu 


Asn 
55 


Glu 


Thr 


Val 


Thr 


Val 
60 


Ser 


Ala 


Ser 


Leu 


Glu 


Ser 


Val 


Arg 


Gly 


Asn 


Arg 


Ser 


Leu 


Phe 


Thr 


Asp 


Leu 


Glu 


Ala 


Glu 


65 










70 










75 










80 


Asn 


Asp 


Val 


Leu 


His 
85 


Cys 


Val 


Ala 


Phe 


Ala 
90 


Val 


Pro 


Lys 


Ser 


Ser 
95 


Ser 


Asn 


Glu 


Glu 


Val 
100 


Met 


Phe 


Leu 


Thr 


Val 
105 


Gin 


Val 


Lys 


Gly 


Pro 
110 


Thr 


Gin 


Glu 


Phe 


Lys 
115 


Lys 


Arg 


Thr 


Thr 


Val 
120 


Met 


Val 


Lys 


Asn 


Glu 
125 


Asp 


Ser 


Leu 
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Val Phe Val Gin 
130 

Lys Phe Arg Val 
145 

Leu lie Pro Leu 

Gin Trp Gin Ser 
180 

Pro Leu Ser Ser 
195 

Lys Lys Ser Gly 
210 

Val Leu Pro Lys 
225 

lie Leu Glu Glu 

Gly Lys Pro Val 
260 

Ser Asp Ala Ser 
275 

Lys Phe Ser Gly 
290 

Lys Thr Lys Val 
305 

His Thr Glu Ala 

Gly Arg Gin Ser 
340 

Val Lys Val Asp 
355 

Val Arg Leu Val 
370 

Phe lie Arg Gly 
385 

Glu His Gly Leu 

Thr Ser Leu Thr 
420 

Gly Tyr Gin Trp 
435 

Tyr Leu Val Phe 
450 

Ser His Glu Leu 
465 

lie Leu Asn Gly 

Tyr Leu lie Met 
500 

Leu Leu Val Lys 
515 

Pro Val Lys Ser 
530 

Val Leu Pro Thr 
545 

Glu Asn Cys Leu 

Ser Leu Pro Ala 
580 



Thr Asp Lys Ser 
135 

Val Ser Met Asp 
150 

Val Tyr lie Gin 
165 

Phe Gin Leu Glu 

Glu Pro Phe Gin 
200 

Gly Arg Thr Glu 
215 

Phe Glu Val Gin 
230 

Glu Met Asn Val 
245 

Pro Gly His Val 

Asp Cys His Gly 
280 

Gin Leu Asn Ser 
295 

Phe Gin Leu Lys 
310 

Gin lie Gin Glu 
325 

Ser Glu lie Thr 

Ser His Phe Arg 
360 

Asp Gly Lys Gly 
375 

Asn Glu Ala Asn 
390 

Val Gin Phe Ser 
405 

Val Arg Val Asn 

Val Ser Glu Glu 
440 

Ser Pro Ser Lys 
455 

Pro Cys Gly His 
470 

Gly Thr Leu Leu 
485 

Ala Lys Gly Gly 

Gin Glu Asp Met 
520 

Asp lie Ala Pro 
535 

Gly Asp Val lie 
550 

Ala Asn Lys Val 
565 

Ser His Ala His 



lie Tyr Lys Pro 
140 

Glu Asn Phe His 
155 

Asp Pro Lys Gly 
170 

Gly Gly Leu Lys 
185 

Gly Ser Tyr Lys 

His Pro Phe Thr 
220 

Val Thr Val Pro 
235 

Ser Val Cys Gly 
250 

Thr Val Ser lie 
265 

Glu Asp Ser Gin 

His Gly Cys Phe 
300 

Arg Lys Glu Tyr 
315 

Glu Gly Thr Val 
330 

Arg Thr lie Thr 
345 

Gin Gly lie Pro 

Val Pro lie Pro 
380 

Tyr Tyr Ser Asn 
395 

lie Asn Thr Thr 
410 

Tyr Lys Asp Arg 
425 

His Glu Glu Ala 

Ser Phe Val His 
460 

Thr Gin Thr Val 
475 

Gly Leu Lys Lys 
490 

lie Val Arg Thr 
505 

Lys Gly His Phe 

Val Ala Arg Leu 
540 

Gly Asp Ser Ala 
555 

Asp Leu Ser Phe 
570 

Leu Arg Val Thr 
585 



Gly Gin Thr Val 

Pro Leu Asn Glu 
160 

Asn Arg lie Ala 
175 

Gin Phe Ser Phe 
190 

Val Val Val Gin 
205 

Val Glu Glu Phe 

Lys lie lie Thr 
240 

Leu Tyr Thr Tyr 
255 

Cys Arg Lys Tyr 
270 

Ala Phe Cys Glu 
285 

Tyr Gin Gin Val 

Glu Met Lys Leu 
320 

Val Glu Leu Thr 
335 

Lys Leu Ser Phe 
350 

Phe Phe Gly Gin 
365 

Asn Lys Val lie 

Ala Thr Thr Asp 
400 

Asn Val Met Gly 
415 

Ser Pro Cys Tyr 
430 

His His Thr Ala 
445 

Leu Glu Pro Met 

Gin Ala His Tyr 
480 

Leu Ser Phe Tyr 
495 

Gly Thr His Gly 
510 

Ser lie Ser lie 
525 

Leu lie Tyr Ala 

Lys Tyr Asp Val 
560 

Ser Pro Ser Gin 
575 

Ala Ala Pro Gin 
590 
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Ser Val Cys Ala Leu Arg Ala Val Asp Gin Ser Val Leu Leu Met Lys 

595 600 605 

Pro Asp Ala Glu Leu Ser Ala Ser Ser Val Tyr Asn Leu Leu Pro Glu 

610 615 620 

Lys Asp Leu Thr Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu 
625 630 635 640 

Asp Cys lie Asn Arg His Asn Val Tyr lie Asn Gly lie Thr Tyr Thr 

645 650 655 

Pro Val Ser Ser Thr Asn Glu Lys Asp Met Tyr Ser Phe Leu Glu Asp 

660 665 670 

Met Gly Leu Lys Ala Phe Thr Asn Ser Lys lie Arg Lys Pro Lys Met 

675 680 685 

Cys Pro Gin Leu Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg 

690 695 700 

Val Gly Phe Tyr Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu 
705 710 715 720 

Val His Val Glu Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro 

725 730 735 

Glu Thr Trp lie Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala 

740 745 750 

Glu Val Gly Val Thr Val Pro Asp Thr lie Thr Glu Trp Lys Ala Gly 

755 760 765 

Ala Phe Cys Leu Ser Glu Asp Ala Gly Leu Gly He Ser Ser Thr Ala 

770 775 780 

Ser Leu Arg Ala Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr 
785 790 795 800 

Ser Val He Arg Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn 

805 810 815 

Tyr Leu Pro Lys Cys He Arg Val Ser Val Gin Leu Glu Ala Ser Pro 

820 825 830 

Ala Phe Leu Ala Val Pro Val Glu Lys Glu Gin Ala Pro His Cys He 

835 840 845 

Cys Ala Asn Gly Arg Gin Thr Val Ser Trp Ala Val Thr Pro Lys Ser 

850 855 860 

Leu Gly Asn Val Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin 
865 870 875 880 

Glu Leu Cys Gly Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys 

885 890 895 

Asp Thr Val He Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys 

900 905 910 

Glu Thr Thr Phe Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser 

915 920 925 

Glu Glu Leu Ser Leu Lys Leu Pro Pro Asn Val Val Glu Glu Ser Ala 

930 935 940 

Arg Ala Ser Val Ser Val Leu Gly Asp He Leu Gly Ser Ala Met Gin 
945 950 955 960 

Asn Thr Gin Asn Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Gin Asn 

965 970 975 

Met Val Leu Phe Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu 

980 985 990 

Thr Gin Gin Leu Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu 

995 1000 1005 

Asn Thr Gly Tyr Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser 

1010 1015 1020 

Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr He 
1045 1050 1055 
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Phe lie Asp Glu Ala His lie Thr Gin Ala Leu lie Trp Leu Ser Gin 

1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 

1075 1080 1085 

Asn Ala lie Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 

1090 1095 1100 

He Thr He Ala Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val 
1105 HIO 1H5 H20 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 

1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 

1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 

1155 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 

1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 H90 H95 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 

1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 

1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 

1235 1240 1245 

Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 

1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 

1285 1290 1295 

Arg Leu Leu Leu Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 

1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 

1315 1320 1325 

Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 

1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 

1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 

1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 

1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn 

1410 1415 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 



<210> 6 
<211> 14896 
<212> DNA 
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<213> Homo sapiens 



<400> 6 

cagcggtgcg agctccaggc ccatgcactg aggaggcgga aacaagggga gcccccagag 6 0 

ctccatcaag ccccctccaa aggctcccct acccggtcca cgccccccac cccccctccc 120 

cgcctcctcc caattgtgca tttttgcagc cggaggcggc tccgagatgg ggctgtgagc 180 

ttcgcccggg gagggggaaa gagcagcgag gagtgaagcg ggggggtggg gtgaagggtt 24 0 

tggatttcgg ggcagggggc gcacccccgt cagcaggccc tccccaaggg gctcggaact 300 

ctacctcttc acccacgccc ctggtgcgct ttgccgaagg aaagaataag aacagagaag 360 

gaggaggggg aaaggaggaa aagggggacc ccccaactgg ggggggtgaa ggagagaagt 42 0 

agcaggacca gaggggaagg ggctgctgct tgcatcagcc cacaccatgc tgaccccgcc 480 

gttgctcctg ctgctgcccc tgctctcagc tctggtcgcg gcggctatcg acgcccctaa 540 

gacttgcagc cccaagcagt ttgcctgcag agatcaaata acctgtatct caaagggctg 600 

gcggtgcgac ggtgagaggg actgcccaga cggatctgac gaggcccctg agatttgtcc 660 

acagagtaag gcccagcgat gccagccaaa cgagcataac tgcctgggta ctgagctgtg 72 0 

tgttcccatg tcccgcctct gcaatggggt ccaggactgc atggacggct cagatgaggg 780 

gccccactgc cgagagctcc aaggcaactg ctctcgcctg ggctgccagc accattgtgt 840 

ccccacactc gatgggccca cctgctactg caacagcagc tttcagcttc aggcagatgg 900 

caagacctgc aaagattttg atgagtgctc agtgtacggc acctgcagcc agctatgcac 960 

caacacagac ggctccttca tatgtggctg tgttgaagga tacctcctgc agccggataa 1020 

ccgctcctgc aaggccaaga acgagccagt agaccggccc cctgtgctgt tgatagccaa 1080 

ctcccagaac atcttggcca cgtacctgag tggggcccag gtgtctacca tcacacctac 1140 

gagcacgcgg cagaccacag ccatggactt cagctatgcc aacgagaccg tatgctgggt 12 00 

gcatgttggg gacagtgctg ctcagacgca gctcaagtgt gcccgcatgc ctggcctaaa 1260 

gggcttcgtg gatgagcaca ccatcaacat ctccctcagt ctgcaccacg tggaacagat 1320 

ggccatcgac tggctgacag gcaacttcta ctttgtggat gacatcgatg ataggatctt 1380 

tgtctgcaac agaaatgggg acacatgtgt cacattgcta gacctggaac tctacaaccc 1440 

caagggcatt gccctggacc ctgccatggg gaaggtgttt ttcactgact atgggcagat 15 0 0 

cccaaaggtg gaacgctgtg acatggatgg gcagaaccgc accaagctcg tcgacagcaa 1560 

gattgtgttt cctcatggca tcacgctgga cctggtcagc cgccttgtct actgggcaga 1620 

tgcctatctg gactatattg aagtggtgga ctatgagggc aagggccgcc agaccatcat 16 80 

ccagggcatc ctgattgagc acctgtacgg cctgactgtg tttgagaatt atctctatgc 1740 

caccaactcg gacaatgcca atgcccagca gaagacgagt gtgatccgtg tgaaccgctt 1800 

taacagcacc gagtaccagg ttgtcacccg ggtggacaag ggtggtgccc tccacatcta 1860 

ccaccagagg cgtcagcccc gagtgaggag ccatgcctgt gaaaacgacc agtatgggaa 1920 

gccgggtggc tgctctgaca tctgcctgct ggccaacagc cacaaggcgc ggacctgccg 1980 

ctgccgttcc ggcttcagcc tgggcagtga cgggaagtca tgcaagaagc cggagcatga 2040 

gctgttcctc gtgtatggca agggccggcc aggcatcatc cggggcatgg atatgggggc 2100 

caaggtcccg gatgagcaca tgatccccat tgaaaacctc atgaaccccc gagccctgga 2160 

cttccacgct gagaccggct tcatctactt tgccgacacc accagctacc tcattggccg 2220 

ccagaagatt gatggcactg agcgggagac catcctgaag gacggcatcc acaatgtgga 22 80 

gggtgtggcc gtggactgga tgggagacaa tctgtactgg acggacgatg ggcccaaaaa 2340 

gacaatcagc gtggccaggc tggagaaagc tgctcagacc cgcaagactt taatcgaggg 24 0 0 

caaaatgaca caccccaggg ctattgtggt ggatccactc aatgggtgga tgtactggac 2460 

agactgggag gaggacccca aggacagtcg gcgtgggcgg ctggagaggg cgtggatgga 252 0 

tggctcacac cgagacatct ttgtcacctc caagacagtg ctttggccca atgggctaag 2580 

cctggacatc ccggctgggc gcctctactg ggtggatgcc ttctacgacc gcatcgagac 2640 

gatactgctc aatggcacag accggaagat tgtgtatgaa ggtcctgagc tgaaccacgc 2700 

ctttggcctg tgtcaccatg gcaactacct cttctggact gagtatcgga gtggcagtgt 2760 

ctaccgcttg gaacggggtg taggaggcgc accccccact gtgacccttc tgcgcagtga 282 0 

gcggcccccc atctttgaga tccgaatgta tgatgcccag cagcagcaag ttggcaccaa 2880 

caaatgccgg gtgaacaatg gcggctgcag cagcctgtgc ttggccaccc ctgggagccg 2940 

ccagtgcgcc tgtgctgagg accaggtgtt ggacgcagac ggcgtcactt gcttggcgaa 3000 

cccatcctac gtgcctccac cccagtgcca gccaggcgag tttgcctgtg ccaacagccg 3060 

ctgcatccag gagcgctgga agtgtgacgg agacaacgat tgcctggaca acagtgatga 312 0 

ggccccagcc ctctgccatc agcacacctg cccctcggac cgattcaagt gcgagaacaa 3180 

ccggtgcatc cccaaccgct ggctctgcga cggggacaat gactgtggga acagtgaaga 3240 

tgagtccaat gccacttgtt cagcccgcac ctgccccccc aaccagttct cctgtgccag 3300 
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tggccgctgc atccccatct cctggacgtg tgatctggat gacgactgtg gggaccgctc 3 3 60 

tgatgagtct gcttcgtgtg cctatcccac ctgcttcccc ctgactcagt ttacctgcaa 3420 

caatggcaga tgtatcaaca tcaactggag atgcgacaat gacaatgact gtggggacaa 3480 

cagtgacgaa gccggctgca gccactcctg ttctagcacc cagttcaagt gcaacagcgg 3 54 0 

gcgttgcatc cccgagcact ggacctgcga tggggacaat gactgcggag actacagtga 3 600 

tgagacacac gccaactgca ccaaccaggc cacgaggccc cctggtggct gccacactga 3 660 

tgagttccag tgccggctgg atggactatg catccccctg cggtggcgct gcgatgggga 3 72 0 

cactgactgc atggactcca gcgatgagaa gagctgtgag ggagtgaccc acgtctgcga 3780 

tcccagtgtc aagtttggct gcaaggactc agctcggtgc atcagcaaag cgtgggtgtg 3 840 

tgatggcgac aatgactgtg aggataactc ggacgaggag aactgcgagt ccctggcctg 3 900 

caggccaccc tcgcaccctt gtgccaacaa cacctcagtc tgcctgcccc ctgacaagct 3960 

gtgtgatggc aacgacgact gtggcgacgg ctcagatgag ggcgagctct gcgaccagtg 4 02 0 

ctctctgaat aacggtggct gcagccacaa ctgctcagtg gcacctggcg aaggcattgt 4080 

gtgttcctgc cctctgggca tggagctggg gcccgacaac cacacctgcc agatccagag 4140 

ctactgtgcc aagcatctca aatgcagcca aaagtgcgac cagaacaagt tcagcgtgaa 42 00 

gtgctcctgc tacgagggct gggtcctgga acctgacggc gagagctgcc gcagcctgga 4260 

ccccttcaag ccgttcatca ttttctccaa ccgccatgaa atccggcgca tcgatcttca 4320 

caaaggagac tacagcgtcc tggtgcccgg cctgcgcaac accatcgccc tggacttcca 43 80 

cctcagccag agcgccctct actggaccga cgtggtggag gacaagatct accgcgggaa 4440 

gctgctggac aacggagccc tgactagttt cgaggtggtg attcagtatg gcctggccac 45 00 

acccgagggc ctggctgtag actggattgc aggcaacatc tactgggtgg agagtaacct 4560 

ggatcagatc gaggtggcca agctggatgg gaccctccgg accaccctgc tggccggtga 4620 

cattgagcac ccaagggcaa tcgcactgga tccccgggat gggatcctgt tttggacaga 4680 

ctgggatgcc agcctgcccc gcattgaggc agcctccatg agtggggctg ggcgccgcac 4740 

cgtgcaccgg gagaccggct ctgggggctg gcccaacggg ctcaccgtgg actacctgga 4800 

gaagcgcatc ctttggattg acgccaggtc agatgccatt tactcagccc gttacgacgg 4860 

ctctggccac atggaggtgc ttcggggaca cgagttcctg tcgcacccgt ttgcagtgac 4920 

gctgtacggg ggggaggtct actggactga ctggcgaaca aacacactgg ctaaggccaa 4980 

caagtggacc ggccacaatg tcaccgtggt acagaggacc aacacccagc cctttgacct 5040 

gcaggtgtac cacccctccc gccagcccat ggctcccaat ccctgtgagg ccaatggggg 5100 

ccagggcccc tgctcccacc tgtgtctcat caactacaac cggaccgtgt cctgcgcctg 5160 

cccccacctc atgaagctcc acaaggacaa caccacctgc tatgagttta agaagttcct 5220 

gctgtacgca cgtcagatgg agatccgagg tgtggacctg gatgctccct actacaacta 5280 

catcatctcc ttcacggtgc ccgacatcga caacgtcaca gtgctagact acgatgcccg 5340 

cgagcagcgt gtgtactggt ctgacgtgcg gacacaggcc atcaagcggg ccttcatcaa 5400 

cggcacaggc gtggagacag tcgtctctgc agacttgcca aatgcccacg ggctggctgt 5460 

ggactgggtc tcccgaaacc tgttctggac aagctatgac accaataaga agcagatcaa 5520 

tgtggcccgg ctggatggct ccttcaagaa cgcagtggtg cagggcctgg agcagcccca 55 80 

tggccttgtc gtccaccctc tgcgtgggaa gctctactgg accgatggtg acaacatcag 5640 

catggccaac atggatggca gcaatcgcac cctgctcttc agtggccaga agggccccgt 57 0 0 

gggcctggct attgacttcc ctgaaagcaa actctactgg atcagctccg ggaaccatac 5760 

catcaaccgc tgcaacctgg atgggagtgg gctggaggtc atcgatgcca tgcggagcca 5820 

gctgggcaag gccaccgccc tggccatcat gggggacaag ctgtggtggg ctgatcaggt 5880 

gtcggaaaag atgggcacat gcagcaaggc tgacggctcg ggctccgtgg tccttcggaa 5940 

cagcaccacc ctggtgatgc acatgaaggt ctatgacgag agcatccagc tggaccataa 6000 

gggcaccaac ccctgcagtg tcaacaacgg tgactgctcc cagctctgcc tgcccacgtc 6060 

agagacgacc cgctcctgca tgtgcacagc cggctatagc ctccggagtg gccagcaggc 612 0 

ctgcgagggc gtaggttcct ttctcctgta ctctgtgcat gagggaatca ggggaattcc 6180 

cctggatccc aatgacaagt cagatgccct ggtcccagtg tccgggacct cgctggctgt 624 0 

cggcatcgac ttccacgctg aaaatgacac catctactgg gtggacatgg gcctgagcac 63 0 0 

gatcagccgg gccaagcggg accagacgtg gcgtgaagac gtggtgacca atggcattgg 63 6 0 

ccgtgtggag ggcattgcag tggactggat cgcaggcaac atctactgga cagaccaggg 642 0 

ctttgatgtc atcgaggtcg cccggctcaa tggctccttc cgctacgtgg tgatctccca 64 8 0 

gggtctagac aagccccggg ccatcaccgt ccacccggag aaagggtact tgttctggac 654 0 

tgagtggggt cagtatccgc gtattgagcg gtctcggcta gatggcacgg agcgtgtggt 6600 

gctggtcaac gtcagcatca gctggcccaa cggcatctca gtggactacc aggatgggaa 6660 

gctgtactgg tgcgatgcac ggacagacaa gattgaacgg atcgacctgg agacaggtga 672 0 

gaaccgcgag gtggttctgt ccagcaacaa catggacatg ttttcagtgt ctgtgtttga 6780 
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ggatttcatc tactggagtg acaggactca tgccaacggc tctatcaagc gcgggagcaa 6840 

agacaatgcc acagactccg tgcccctgcg aaccggcatc ggcgtccagc ttaaagacat 690 0 

caaagtcttc aaccgggacc ggcagaaagg caccaacgtg tgcgcggtgg ccaatggcgg 6960 

gtgccagcag ctgtgcctgt accggggccg tgggcagcgg gcctgcgcct gtgcccacgg 7 02 0 

gatgctggct gaagacggag catcgtgccg cgagtatgcc ggctacctgc tctactcaga 7080 

gcgcaccatt ctcaagagta tccacctgtc ggatgagcgc aacctcaatg cgcccgtgca 7140 

gcccttcgag gaccctgagc acatgaagaa cgtcatcgcc ctggcctttg actaccgggc 7200 

aggcacctct ccgggcaccc ccaatcgcat cttcttcagc gacatccact ttgggaacat 7260 

ccaacagatc aacgacgatg gctccaggag gatcaccatt gtggaaaacg tgggctccgt 7320 

ggaaggcctg gcctatcacc gtggctggga cactctctat tggacaagct acacgacatc 7380 

caccatcacg cgccacacag tggaccagac ccgcccaggg gccttcgagc gtgagaccgt 7440 

catcactatg tctggagatg accacccacg ggccttcgtt ttggacgagt gccagaacct 7500 

catgttctgg accaactgga atgagcagca tcccagcatc atgcgggcgg cgctctcggg 7560 

agccaatgtc ctgaccctta tcgagaagga catccgtacc cccaatggcc tggccatcga 7620 

ccaccgtgcc gagaagctct acttctctga cgccaccctg gacaagatcg agcggtgcga 7680 

gtatgacggc tcccaccgct atgtgatcct aaagtcagag cctgtccacc ccttcgggct 7740 

ggccgtgtat ggggagcaca ttttctggac tgactgggtg cggcgggcag tgcagcgggc 78 00 

caacaagcac gtgggcagca acatgaagct gctgcgcgtg gacatccccc agcagcccat 7860 

gggcatcatc gccgtggcca acgacaccaa cagctgtgaa ctctctccat gccgaatcaa 7920 

caacggtggc tgccaggacc tgtgtctgct cactcaccag ggccatgtca actgctcatg 7980 

ccgagggggc cgaatcctcc aggatgacct cacctgccga gcggtgaatt cctcttgccg 8040 

agcacaagat gagtttgagt gtgccaatgg cgagtgcatc aacttcagcc tgacctgcga 8100 

cggcgtcccc cactgcaagg acaagtccga tgagaagcca tcctactgca actcccgccg 8160 

ctgcaagaag actttccggc agtgcagcaa tgggcgctgt gtgtccaaca tgctgtggtg 822 0 

caacggggcc gacgactgtg gggatggctc tgacgagatc ccttgcaaca agacagcctg 82 80 

tggtgtgggc gagttccgct gccgggacgg gacctgcatc gggaactcca gccgctgcaa 8340 

ccagtttgtg gattgtgagg acgcctcaga tgagatgaac tgcagtgcca ccgactgcag 8400 

cagctacttc cgcctgggcg tgaagggcgt gctcttccag ccctgcgagc ggacctcact 8460 

ctgctacgca cccagctggg tgtgtgatgg cgccaatgac tgtggggact acagtgatga 852 0 

gcgcgactgc ccaggtgtga aacgccccag atgccctctg aattacttcg cctgccctag 8580 

tgggcgctgc atccccatga gctggacgtg tgacaaagag gatgactgtg aacatggcga 8640 

ggacgagacc cactgcaaca agttctgctc agaggcccag tttgagtgcc agaaccatcg 8700 

ctgcatctcc aagcagtggc tgtgtgacgg cagcgatgac tgtggggatg gctcagacga 8760 

ggctgctcac tgtgaaggca agacgtgcgg cccctcctcc ttctcctgcc ctggcaccca 8820 

cgtgtgcgtc cccgagcgct ggctctgtga cggtgacaaa gactgtgctg atggtgcaga 8880 

cgagagcatc gcagctggtt gcttgtacaa cagcacttgt gacgaccgtg agttcatgtg 8940 

ccagaaccgc cagtgcatcc ccaagcactt cgtgtgtgac cacgaccgtg actgtgcaga 9000 

tggctctgat gagtcccccg agtgtgagta cccgacctgc ggccccagtg agttccgctg 9060 

tgccaatggg cgctgtctga gctcccgcca gtgggagtgt gatggcgaga atgactgcca 9120 

cgaccagagt gacgaggctc ccaagaaccc acactgcacc agcccagagc acaagtgcaa 9180 

tgcctcgtca cagttcctgt gcagcagtgg gcgctgtgtg gctgaggcac tgctctgcaa 924 0 

cggccaggat gactgtggcg acagctcgga cgagcgtggc tgccacatca atgagtgtct 93 0 0 

cagccgcaag ctcagtggct gcagccagga ctgtgaggac ctcaagatcg gcttcaagtg 9360 

ccgctgtcgc cctggcttcc ggctgaagga tgacggccgg acgtgtgctg atgtggacga 942 0 

gtgcagcacc accttcccct gcagccagcg ctgcatcaac acccatggca gctataagtg 9480 

tctgtgtgtg gagggctatg caccccgcgg cggcgacccc cacagctgca aggctgtgac 954 0 

tgacgaggaa ccgtttctga tcttcgccaa ccggtactac ctgcgcaagc tcaacctgga 960 0 

cgggtccaac tacacgttac ttaagcaggg cctgaacaac gccgttgcct tggattttga 9660 

ctaccgagag cagatgatct actggacaga tgtgaccacc cagggcagca tgatccgaag 972 0 

gatgcacctt aacgggagca atgtgcaggt cctacaccgt acaggcctca gcaaccccga 97 8 0 

tgggctggct gtggactggg tgggtggcaa cctgtactgg tgcgacaaag gccgggacac 9840 

catcgaggtg tccaagctca atggggccta tcggacggtg ctggtcagct ctggcctccg 9900 

tgagcccagg gctctggtgg tggatgtgca gaatgggtac ctgtactgga cagactgggg 9960 

tgaccattca ctgatcggcc gcatcggcat ggatgggtcc agccgcagcg tcatcgtgga 1002 0 

caccaagatc acatggccca atggcctgac gctggactat gtcactgagc gcatctactg 10080 

ggccgacgcc cgcgaggact acattgaatt tgccagcctg gatggctcca atcgccacgt 10140 

tgtgctgagc caggacatcc cgcacatctt tgcactgacc ctgtttgagg actacgtcta 102 00 

ctggaccgac tgggaaacaa agtccattaa ccgagcccac aagaccacgg gcaccaacaa 10260 
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aacgctcctc atcagcacgc tgcaccggcc catggacctg catgtcttcc atgccctgcg 10320 

ccagccagac gtgcccaatc acccctgcaa ggtcaacaat ggtggctgca gcaacctgtg 10380 

cctgctgtcc cccgggggag ggcacaaatg tgcctgcccc accaacttct acctgggcag 10440 

cgatgggcgc acctgtgtgt ccaactgcac ggctagccag tttgtatgca agaacgacaa 10500 

gtgcatcccc ttctggtgga agtgtgacac cgaggacgac tgcggggacc actcagacga 10560 

gcccccggac tgccctgagt tcaagtgccg gcccggacag ttccagtgct ccacaggtat 10620 

ctgcacaaac cctgccttca tctgcgatgg cgacaatgac tgccaggaca acagtgacga 10680 

ggccaactgt gacatccacg tctgcttgcc cagtcagttc aaatgcacca acaccaaccg 10740 

ctgtattccc ggcatcttcc gctgcaatgg gcaggacaac tgcggagatg gggaggatga 10 8 00 

gagggactgc cccgaggtga cctgcgcccc caaccagttc cagtgctcca ttaccaaacg 10860 

gtgcatcccc cgggtctggg tctgcgaccg ggacaatgac tgtgtggatg gcagtgatga 10920 

gcccgccaac tgcacccaga tgacctgtgg tgtggacgag ttccgctgca aggattcggg 10 980 

ccgctgcatc ccagcgcgtt ggaagtgtga cggagaggat gactgtgggg atggctcgga 11040 

tgagcccaag gaagagtgtg atgaacgcac ctgtgagcca taccagttcc gctgcaagaa 11100 

caaccgctgc gtgcccggcc gctggcagtg cgactacgac aacgattgcg gtgacaactc 11160 

cgatgaagag agctgcaccc ctcggccctg ctccgagagt gagttctcct gtgccaacgg 1122 0 

ccgctgcatc gcggggcgct ggaaatgcga tggagaccac gactgcgcgg acggctcgga 11280 

cgagaaagac tgcacccccc gctgtgacat ggaccagttc cagtgcaaga gcggccactg 11340 

catccccctg cgctggcgct gtgacgcaga cgccgactgc atggacggca gcgacgagga 11400 

ggcctgcggc actggcgtgc ggacctgccc cctggacgag ttccagtgca acaacacctt 11460 

gtgcaagccg ctggcctgga agtgcgatgg cgaggatgac tgtggggaca actcagatga 11520 

gaaccccgag gagtgtgccc ggttcgtgtg ccctcccaac cggcccttcc gttgcaagaa 115 80 

tgaccgcgtc tgtctgtgga tcgggcgcca atgcgatggc acggacaact gtggggatgg 1164 0 

gactgatgaa gaggactgtg agccccccac agcccacacc acccactgca aagacaagaa 117 00 

ggagtttctg tgccggaacc agcgctgcct ctcctcctcc ctgcgctgca acatgttcga 11760 

tgactgcggg gacggctctg acgaggagga ctgcagcatc gaccccaagc tgaccagctg 11820 

cgccaccaat gccagcatct gtggggacga ggcacgctgc gtgcgcaccg agaaagcggc 118 80 

ctactgtgcc tgccgctcgg gcttccacac cgtgcccggc cagcccggat gccaagacat 11940 

caacgagtgc ctgcgcttcg gcacctgctc ccagctctgc aacaacacca agggcggcca 12000 

cctctgcagc tgcgctcgga acttcatgaa gacgcacaac acctgcaagg ccgaaggctc 12060 

tgagtaccag gtcctgtaca tcgctgatga caatgagatc cgcagcctgt tccccggcca 12120 

cccccattcg gcttacgagc aggcattcca gggtgacgag agtgtccgca ttgatgctat 12180 

ggatgtccat gtcaaggctg gccgtgtcta ttggaccaac tggcacacgg gcaccatctc 12240 

ctaccgcagc ctgccacctg ctgcgcctcc taccacttcc aaccgccacc ggcgacagat 12300 

tgaccggggt gtcacccacc tcaacatttc agggctgaag atgcccagag gcatcgccat 123 60 

cgactgggtg gccggaaacg tgtactggac cgactcgggc cgagatgtga ttgaggtggc 124 2 0 

gcagatgaag ggcgagaacc gcaagacgct catctcgggc atgattgacg agccccacgc 124 80 

cattgtggtg gacccactga gggggaccat gtactggtca gactggggca accaccccaa 12540 

gattgagacg gcagcgatgg atgggacgct tcgggagaca ctggtgcagg acaacattca 12600 

gtggcccaca ggcctggccg tggattatca caatgagcgg ctgtactggg cagacgccaa 12660 

gctttcagtc atcggcagca tccggctcaa tggcacggac cccattgtgg ctgctgacag 12720 

caaacgaggc ctaagtcacc ccttcagcat cgacgtcttt gaggattaca tctatggtgt 12780 

cacctacatc aataatcgtg tcttcaagat ccataagttt ggccacagcc ccttggtcaa 12840 

cctgacaggg ggcctgagcc acgcctctga cgtggtcctt taccatcagc acaagcagcc 12900 

cgaagtgacc aacccatgtg accgcaagaa atgcgagtgg ctctgcctgc tgagccccag 12960 

tgggcctgtc tgcacctgtc ccaatgggaa gcggctggac aacggcacat gcgtgcctgt 1302 0 

gccctctcca acgccccccc cagatgctcc ccggcctgga acctgtaacc tgcagtgctt 13080 

caacggtggc agctgtttcc tcaatgcacg gaggcagccc aagtgccgct gccaaccccg 1314 0 

ctacacgggt gacaagtgtg aactggacca gtgctgggag cactgtcgca atgggggcac 13200 

ctgtgctgcc tccccctctg gcatgcccac gtgccggtgc cccacgggct tcacgggccc 132 60 

caaatgcacc cagcaggtgt gtgcgggcta ctgtgccaac aacagcacct gcactgtcaa 13320 

ccagggcaac cagccccagt gccgatgcct acccggcttc ctgggcgacc gctgccagta 13380 

ccggcagtgc tctggctact gtgagaactt tggcacatgc cagatggctg ctgatggctc 1344 0 

ccgacaatgc cgctgcactg cctactttga gggatcgagg tgtgaggtga acaagtgcag 13500 

ccgctgtctc gaaggggcct gtgtggtcaa caagcagagt ggggatgtca cctgcaactg 13560 

cacggatggc cgggtggccc ccagctgtct gacctgcgtc ggccactgca gcaatggcgg 13620 

ctcctgtacc atgaacagca aaatgatgcc tgagtgccag tgcccacccc acatgacagg 13680 

gccccggtgt gaggagcacg tcttcagcca gcagcagcca ggacatatag cctccatcct 13740 
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aatccctctg ctgttgctgc tgctgctggt tctggtggcc ggagtggtat tctggtataa 13800 

gcggcgagtc caaggggcta agggcttcca gcaccaacgg atgaccaacg gggccatgaa 13 860 

cgtggagatt ggaaacccca cctacaagat gtacgaaggc ggagagcctg atgatgtggg 13 92 0 

aggcctactg gacgctgact ttgccctgga ccctgacaag cccaccaact tcaccaaccc 13980 

cgtgtatgcc acactctaca tggggggcca tggcagtcgc cactccctgg ccagcacgga 14040 

cgagaagcga gaactcctgg gccggggccc tgaggacgag ataggggacc ccttggcata 14100 

gggccctgcc ccgtcggact gcccccagaa agcctcctgc cccctgccgg tgaagtcctt 14160 

cagtgagccc ctccccagcc agcccttccc tggccccgcc ggatgtataa atgtaaaaat 14220 

gaaggaatta cattttatat gtgagcgagc aagccggcaa gcgagcacag tattatttct 142 80 

ccatcccctc cctgcctgct ccttggcacc cccatgctgc cttcagggag acaggcaggg 14340 

agggcttggg gctgcacctc ctaccctccc accagaacgc accccactgg gagagctggt 14400 

ggtgcagcct tcccctccct gtataagaca ctttgccaag gctctcccct ctcgccccat 14460 

ccctgcttgc ccgctcccac agcttcctga gggctaattc tgggaaggga gagttctttg 14520 

ctgcccctgt ctggaagacg tggctctggg tgaggtaggc gggaaaggat ggagtgtttt 14580 

agttcttggg ggaggccacc ccaaacccca gccccaactc caggggcacc tatgagatgg 14640 

ccatgctcaa cccccctccc agacaggccc tccctgtctc cagggccccc accgaggttc 14700 

ccagggctgg agacttcctc tggtaaacat tcctccagcc tcccctcccc tggggacgcc 14760 

aaggaggtgg gccacaccca ggaagggaaa gcgggcagcc ccgttttggg gacgtgaacg 14820 

ttttaataat ttttgctgaa ttctttacaa ctaaataaca cagatattct tataaataaa 14880 

attgtaaaaa aaaaaa 14896 



<210> 7 

<211> 126 

<212> PRT 

<213> Homo sapiens 



<400> 7 
lie Ala Leu Asp Phe 

1 5 
Val Val Glu Asp Lys 
20 

Leu Thr Ser Phe Glu 
35 

Gly Leu Ala Val Asp 
50 

Asn Leu Asp Gin lie 
65 

Thr Leu Leu Ala Gly 
85 

Pro Arg Asp Gly lie 
100 

Arg lie Glu Ala Ala 
115 



His Leu Ser Gin Ser Ala 
10 

lie Tyr Arg Gly Lys Leu 
25 

Val Val He Gin Tyr Gly 
40 

Trp He Ala Gly Asn He 
55 

Glu Val Ala Lys Leu Asp 
70 75 
Asp He Glu His Pro Arg 
90 

Leu Phe Trp Thr Asp Trp 
105 

Ser Met Ser Gly Ala Gly 
120 



Leu Tyr Trp Thr Asp 
15 

Leu Asp Asn Gly Ala 
30 

Leu Ala Thr Pro Glu 
45 

Tyr Trp Val Glu Ser 
60 

Gly Thr Leu Arg Thr 
80 

Ala He Ala Leu Asp 
95 

Asp Ala Ser Leu Pro 
110 

Arg Arg Thr 
125 



<210> 8 

<211> 153 

<212> PRT 

<213> Homo sapiens 



<400> 8 
Leu Leu Gin Gin 
1 

Lys Val Thr Gly 
20 

Asn He Leu Pro 
35 



Val Ser Leu Pro 
5 

Glu Gly Cys Val 

Glu Lys Glu Glu 
40 



Glu Leu Pro Gly 
10 

Tyr Leu Gin Thr 
25 

Phe Pro Phe Ala 



Glu Tyr Ser Met 
15 

Ser Leu Lys Tyr 
30 

Leu Gly Val Gin 
45 
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Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin 

50 55 60 

lie Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met 
65 70 75 80 

Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro 

85 90 95 

Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val 

100 105 110 

Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin Thr 

115 120 125 

Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu 

130 135 140 

Lys Pro Ala He Val Lys Val Tyr Asp 
145 150 

<210> 9 
<211> 138 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys 

15 10 15 

Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val 

20 25 30 

Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe 

35 40 45 

Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn 

50 55 60 

Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys 
65 70 75 80 

Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu 

85 90 95 

Val Ser Ser Asn His Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin 

100 105 110 

Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp 

115 120 125 

Leu Lys Pro Ala He Val Lys Val Tyr Asp 
130 135 

<210> 10 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val 

15 10 15 

Asp Val Lys Met Val Ser Gly Phe He Pro Leu 
20 25 

<210> 11 

<211> 126 

<212> PRT 

<213> Homo sapiens 

<400> 11 
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Leu Gin Gin Val 

1 

Val Thr Gly Glu 
20 

He Leu Pro Glu 
35 

Leu Pro Gin Thr 
50 

Ser Leu Ser Val 
65 

He Val Asp Val 

Val Lys Met Leu 
100 

Ser Asn His Val 
115 



Ser Leu Pro Glu 
5 

Gly Cys Val Tyr 

Lys Glu Glu Phe 
40 

Cys Asp Glu Pro 
55 

Ser Tyr Thr Gly 
70 

Lys Met Val Ser 
85 

Glu Arg Ser Asn 

Leu He Tyr Leu 
120 



Leu Pro Gly Glu 
10 

Leu Gin Thr Ser 
25 

Pro Phe Ala Leu 

Lys Ala His Thr 
60 

Ser Arg Ser Ala 
75 

Gly Phe He Pro 
90 

His Val Ser Arg 
105 

Asp Lys Val Ser 



Tyr Ser Met Lys 
15 

Leu Lys Tyr Asn 
30 

Gly Val Gin Thr 
45 

Ser Phe Gin He 

Ser Asn Met Ala 
80 

Leu Lys Pro Thr 
95 

Thr Glu Val Ser 

110 
Asn Gin 
125 



<210> 12 
<211> 111 
<212> PRT 

<213> Homo sapiens 



<400> 12 
Leu Gin Gin Val 
1 

Val Thr Gly Glu 
20 

He Leu Pro Glu 
35 

Leu Pro Gin Thr 
50 

Ser Leu Ser Val 
65 

He Val Asp Val 

Val Lys Met Leu 
100 



Ser Leu Pro Glu 
5 

Gly Cys Val Tyr 

Lys Glu Glu Phe 
40 

Cys Asp Glu Pro 
55 

Ser Tyr Thr Gly 
70 

Lys Met Val Ser 
85 

Glu Arg Ser Asn 



Leu Pro Gly Glu 
10 

Leu Gin Thr Ser 
25 

Pro Phe Ala Leu 

Lys Ala His Thr 
60 

Ser Arg Ser Ala 
75 

Gly Phe He Pro 
90 

His Val Ser Arg 

105 



Tyr Ser Met Lys 

15 

Leu Lys Tyr Asn 
30 

Gly Val Gin Thr 
45 

Ser Phe Gin He 

Ser Asn Met Ala 
80 

Leu Lys Pro Thr 
95 

Thr Glu Val 
110 



<210> 13 

<211> 81 

<212> PRT 

<213> Homo sapie: 



<400> 13 
Leu Gin Gin Val Ser 

1 5 
Val Thr Gly Glu Gly 
20 

He Leu Pro Glu Lys 
35 

Leu Pro Gin Thr Cys 
50 

Ser Leu Ser Val Ser 

65 

He 



Leu Pro Glu Leu Pro Gly 
10 

Cys Val Tyr Leu Gin Thr 
25 

Glu Glu Phe Pro Phe Ala 
40 

Asp Glu Pro Lys Ala His 
55 

Tyr Thr Gly Ser Arg Ser 
70 75 



Glu Tyr Ser Met Lys 
15 

Ser Leu Lys Tyr Asn 
30 

Leu Gly Val Gin Thr 
45 

Thr Ser Phe Gin He 
60 

Ala Ser Asn Met Ala 
80 



<210> 14 
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<211> 101 
<212> PRT 

<213> Homo sapiens 



Gin 


Thr 


Ser 


Leu 


Lys 


Tyr 


Asn 


He 


Leu 


Pro 


Glu 


Lys 


Glu 


Glu 


Phe 


Pro 


1 








5 










10 










15 




Phe 


Ala 


Leu Gly 


Val 


Gin 


Thr 


Leu 


Pro 


Gin 


Thr 


Cys 


Asp 


Glu 


Pro 


Lys 








20 










25 










30 






Ala 


His 


Thr 


Ser 


Phe 


Gin 


He 


Ser 


Leu 


Ser 


Val 


Ser 


Tyr 


Thr 


Gly 


Ser 






35 










40 










45 








Arg 


Ser 


Ala 


Ser 


Asn 


Met 


Ala 


He 


Val 


Asp 


Val 


Lys 


Met 


Val 


Ser 


Gly 




50 










55 










60 










Phe 


He 


Pro 


Leu 


Lys 


Pro 


Thr 


Val 


Lys 


Met 


Leu 


Glu 


Arg 


Ser 


Asn 


His 


65 










70 










75 










80 


Val 


Ser 


Arg 


Thr 


Glu 


Val 


Ser 


Ser 


Asn 


His 


Val 


Leu 


He 


Tyr 


Leu 


Asp 










85 










90 










95 




Lys 


Val 


Ser 


Asn 


Gin 

























100 

<210> 15 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro 

15 10 15 

Phe Ala Leu Gly Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys 

20 25 30 

Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser 

35 40 45 

Arg Ser Ala Ser Asn Met Ala He Val Asp Val Lys Met Val Ser Gly 

50 55 60 

Phe He Pro Leu Lys Pro Thr Val Lys Met Leu Glu 
65 70 75 

<210> 16 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Gin Thr Ser Leu Lys Tyr Asn He Leu Pro Glu Lys Glu Glu Phe Pro 

15 10 15 

Phe Ala Leu Gly Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys 

20 25 30 

Ala His Thr Ser Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser 

35 40 45 

Arg Ser Ala Ser Asn Met Ala He 
50 55 

<210> 17 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<400> 17 
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Gin Thr Cys Asp Glu 

1 5 
Ser Val Ser Tyr Thr 
20 

Asp Val Lys Met Val 
35 

Met Leu Glu Arg Ser 
50 

His Val Leu lie Tyr 
65 



Pro Lys Ala His Thr Ser 
10 

Gly Ser Arg Ser Ala Ser 
25 

Ser Gly Phe lie Pro Leu 
40 

Asn His Val Ser Arg Thr 
55 

Leu Asp Lys Val Ser Asn 
70 75 



Phe Gin He Ser Leu 
15 

Asn Met Ala He Val 
30 

Lys Pro Thr Val Lys 
45 

Glu Val Ser Ser Asn 

60 

Gin 



<210> 18 

<211> 76 

<212> PRT 

<213> Homo sapiens 



<400> 18 

Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu 

15 10 15 

Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He Val 

20 25 30 

Asp Val Lys Met Val Ser Gly Phe He Pro Leu Lys Pro Thr Val Lys 

35 40 45 

Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser Asn 

50 55 60 

His Val Leu He Tyr Leu Asp Lys Val Ser Asn Gin 
65 70 75 



<210> 19 

<211> 31 

<212> PRT 

<213> Homo sapiens 



<400> 19 

Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser Phe Gin He Ser Leu 

15 10 15 

Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He 
20 25 30 



<210> 20 

<211> 44 

<212> PRT 

<213> Homo sapiens 



<400> 20 

Lys Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg Asp Gin He Thr Cys 

15 10 15 

He Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg Asp Cys Pro Asp Gly 

20 25 30 

Ser Asp Glu Ala Pro Glu He Cys Pro Gin Ser Lys 
35 40 



<210> 21 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<400> 21 



30 



Lys 


Thr 


Cys 


Ser 


Pro 


Lys 


Gin 


Phe 


Ala 


Cys 


Arg 


Asp 


Gin 


He Thr 


Cys 


1 








5 










10 








15 




He 


Ser 


Lys 


Gly 


Trp 


Arg 


Cys 


Asp 


Gly 


Glu 


Arg 


Asp 


Cys 


Pro Asp 


Gly 








20 










25 










30 




Ser 


Asp 


Glu 


Ala 


Pro 


Glu 


He 


Cys 


Pro 


Gin 


Ser 


Lys 


Ala 


Gin Arg 


Cys 






35 










40 










45 






Gin 


Pro 


Asn 


Glu 


His 


Asn 


Cys 


Leu 


Gly 


Thr 


Glu 


Leu 


Cys 


Val Pro 


Met 




50 










55 










60 








Ser 


Arg 


Leu 


Cys 


Asn 


Gly 


Val 


Gin 


Asp 


Cys 


Met 


Asp 


Gly Ser Asp 


Glu 


65 










70 










75 








80 


Gly 


Pro 


His 


Cys 


Arg 
85 


Glu 





















<210> 22 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Lys Ala Gin Arg Cys Gin Pro Asn Glu His Asn Cys Leu Gly Thr Glu 

15 10 15 

Leu Cys Val Pro Met Ser Arg Leu Cys Asn Gly Val Gin Asp Cys Met 

20 25 30 

Asp Gly Ser Asp Glu Gly Pro His Cys Arg Glu 
35 40 

<210> 23 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 23 

Gin Cys Gin Pro Gly Glu Phe Ala Cys Ala Asn Ser Arg Cys lie Gin 

15 10 15 

Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser Asp 

20 25 30 

Glu Ala Pro Ala Leu Cys His Gin His Thr 
35 40 

<210> 24 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Gin Cys Gin Pro Gly Glu Phe Ala Cys Ala Asn Ser Arg Cys He Gin 

15 10 15 

Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys Leu Asp Asn Ser Asp 

20 25 30 

Glu Ala Pro Ala Leu Cys His Gin His Thr Cys Pro Ser Asp Arg Phe 

35 40 45 

Lys Cys Glu Asn Asn Arg Cys He Pro Asn Arg Trp Leu Cys Asp Gly 

50 55 60 

Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu Ser Asn Ala Thr Cys Ser 
65 70 75 80 

Ala Arg 



31 



<210> 25 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<400> 25 



Gin 


Cys 


Gin 


Pro 


Gly 


Glu 


Phe 


Ala 


Cys 


Ala 


Asn 


Ser 


Arg 


Cys 


He 


Gin 


1 








5 










10 










15 




Glu 


Arg 


Trp 


Lys 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Leu 


Asp 


Asn 


Ser 


Asp 








20 










25 










30 






Glu 










Cys 










Cys 






Asp 


Arg 


Phe 






35 










40 










45 








Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


Arg 


Trp 


Leu 


Cys 


Asp 


Gly 




50 










55 










60 










Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 


Ser 


Asn 


Ala 


Thr 


Cys 


Ser 












70 






















Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser 


Gly Arg 


Cys 










85 










90 










95 




lie 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly 


Asp 


Arg 








100 










105 










110 






Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


















115 










12 0 




















<210> 


26 




























<211> 


161 




























<212> 


PRT 




























<213> 


Homo sapiens 






















<400> 


26 


























Gin 


Cys 


Gin 


Pro 


Gly 


Glu 


Phe 


Ala 


Cys 


Ala 


Asn 


Ser 


Arg 


Cys 


He 


Gin 


1 








5 










10 










15 




Glu 


Arg 


Trp 


Lys 


Cys 


Asp 


Gly Asp 


Asn 


Asp 


Cys 


Leu 


Asp 


Asn 


Ser 


Asp 








20 










25 










30 






Glu 


Ala 


Pro 


Ala 


Leu 


Cys 


His 


Gin 


His 


Thr 


Cys 


Pro 


Ser 


Asp 


Arg 


Phe 






35 










40 










45 








Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


Arg 


Trp 


Leu 


Cys 


Asp 


Gly 




50 










55 










60 










Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 


Ser 


Asn 


Ala 


Thr 


Cys 


Ser 


65 










70 










75 










80 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser 


Gly 


Arg 


Cys 










85 










90 










95 




He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly Asp 


Arg 








100 










105 










110 






Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


Thr 


Cys 


Phe 


Pro 


Leu 


Thr 






115 










120 










125 








Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 


Asn 


He 


Asn 


Trp 


Arg 


Cys 




130 










135 










140 










Asp 


Asn 


Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Asn 


Ser 


Asp 


Glu 


Ala 


Gly 


Cys 


Ser 


145 










150 










155 










160 



His 



<210> 27 
<211> 208 
<212> PRT 

<213> Homo sapiens 
<400> 27 



32 



Gin 


Cys 


Gin 


Pro 


Gly 


Glu 


Phe 


Ala 


Cys 


Ala 


Asn 


Ser 


Arg 


Cys 


He 


Gin 


1 








5 










10 










15 




Glu 


Arg 


Trp 


Lys 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Leu 


Asp 


Asn 


Ser 


Asp 








20 










25 










30 






Glu 


Ala 


Pro 


Ala 


Leu 


Cys 


His 


Gin 


His 


Thr 


Cys 


Pro 


Ser 


Asp 


Arg 


Phe 






35 










40 










45 








Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


Arg 


Trp 


Leu 


Cys 


Asp 


Gly 




50 










55 










60 










Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 


Ser 


Asn 


Ala 


Thr 


Cys 


Ser 


65 










70 










75 










80 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser 


Gly 


Arg 


Cys 










85 










90 










95 




He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly 


Asp 


Arg 








100 










105 










110 






Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


Thr 


Cys 


Phe 


Pro 


Leu 


Thr 






115 










120 










125 








Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 


Asn 


He 


Asn 


Trp 


Arg 


Cys 




130 










135 










140 










Asp 


Asn 


Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Asn 


Ser 


Asp 


Glu 


Ala 


Gly 


Cys 


Ser 


14 5 










150 










155 










160 
















Phe 


Lys 








Gly 


Arg 


Cys 












165 










170 










175 




Pro 


Glu 


His 


Trp 


Thr 


Cys 


Asp 


Gly Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Tyr 


Ser 








180 










185 










190 






Asp 


Glu 


Thr 


His 


Ala 


Asn 


Cys 


Thr 


Asn 


Gin 


Ala 


Thr 


Arg 


Pro 


Pro 


Gly 






195 










200 










205 










<210> 


28 




























<211> 


150 




























<212> 


PRT 




























<213> 


Homo sapiens 






















<400> 


28 


























Gin 


Cys 


Gin 


Pro 


Gly 


Glu 


Phe 


Ala 


Cys 


Ala 


Asn 


Ser 


Arg 


Cys 


He 


Gin 


1 








5 










10 










15 




Glu 


Arg 


Trp 


Lys 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Leu 


Asp 


Asn 


Ser 


Asp 








20 










25 










30 






Glu 


Ala 


Pro 


Ala 


Leu 


Cys 


His 


Gin 


His 


Thr 


Cys 


Pro 


Ser 


Asp 


Arg 


Phe 






35 










40 










45 








Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


Arg 


Trp 


Leu 


Cys 


Asp 


Gly 




50 










55 










60 










Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu Asp Glu 


Ser 


Asn 


Ala 


Thr 


Cys 


Ser 


65 










70 










75 










80 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser 


Gly 


Arg 


Cys 










85 










90 










95 




He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly 


Asp 


Arg 








100 










105 










110 






Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


Thr 


Cys 




Pro 


Leu 


Thr 






115 










120 










125 








Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 


Asn 


He 


Asn 


Trp 


Arg 


Cys 




130 










135 










140 










Asp 


Asn 


Asp 


Asn 


Asp 


Cys 






















14 5 










150 


























210> 


29 




























<211> 


231 




























<212> 


PRT 



























<213> Homo sapiens 



33 



<400> 29 
Gin Cys Gin Pro 
1 

Glu Arg Trp Lys 
20 

Glu Ala Pro Ala 
35 

Lys Cys Glu Asn 
50 

Asp Asn Asp Cys 
65 

Ala Arg Thr Cys 

lie Pro lie Ser 
100 

Ser Asp Glu Ser 
115 

Gin Phe Thr Cys 
130 

Asp Asn Asp Asn 
14 5 

His Ser Cys Ser 

Pro Glu His Trp 
180 

Asp Glu Thr His 
195 

Gly Cys His Thr 
210 

Pro Leu Arg Trp 
225 



Gly Glu Phe Ala 
5 

Cys Asp Gly Asp 

Leu Cys His Gin 
40 

Asn Arg Cys lie 
55 

Gly Asn Ser Glu 
70 

Pro Pro Asn Gin 
85 

Trp Thr Cys Asp 

Ala Ser Cys Ala 
120 

Asn Asn Gly Arg 
135 

Asp Cys Gly Asp 
150 

Ser Thr Gin Phe 
165 

Thr Cys Asp Gly 

Ala Asn Cys Thr 
200 

Asp Glu Phe Gin 
215 

Arg Cys Asp 
230 



Cys Ala Asn Ser 
10 

Asn Asp Cys Leu 
25 

His Thr Cys Pro 

Pro Asn Arg Trp 
60 

Asp Glu Ser Asn 
75 

Phe Ser Cys Ala 
90 

Leu Asp Asp Asp 
105 

Tyr Pro Thr Cys 

Cys lie Asn lie 
140 

Asn Ser Asp Glu 
155 

Lys Cys Asn Ser 
170 

Asp Asn Asp Cys 
185 

Asn Gin Ala Thr 

Cys Arg Leu Asp 
220 



Arg Cys lie Gin 
15 

Asp Asn Ser Asp 
30 

Ser Asp Arg Phe 
45 

Leu Cys Asp Gly 

Ala Thr Cys Ser 
80 

Ser Gly Arg Cys 
95 

Cys Gly Asp Arg 
110 

Phe Pro Leu Thr 
125 

Asn Trp Arg Cys 

Ala Gly Cys Ser 
160 

Gly Arg Cys lie 
175 

Gly Asp Tyr Ser 
190 

Arg Pro Pro Gly 
205 

Gly Leu Cys lie 



<210> 30 
<211> 40 
<212> PRT 

<213> Homo sapiens 



<400> 30 

Cys Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn Arg Cys lie Pro Asn 

15 10 15 

Arg Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu 

20 25 30 

Ser Asn Ala Thr Cys Ser Ala Arg 
35 40 



<210> 31 

<211> 80 

<212> PRT 

<213> Homo sapiens 



<400> 31 
Cys Pro Ser Asp Arg Phe Lys Cys 

1 5 
Arg Trp Leu Cys Asp Gly Asp Asn 
20 

Ser Asn Ala Thr Cys Ser Ala Arg 
35 40 



Glu Asn Asn Arg Cys lie Pro Asn 

10 15 

Asp Cys Gly Asn Ser Glu Asp Glu 

25 30 

Thr Cys Pro Pro Asn Gin Phe Ser 
45 



34 



Cys 


Ala 


Ser 


Gly Arg 


Cys 


He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 




50 










55 










60 










Asp 


Asp 


Cys 


Gly Asp 


Arg 


Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


65 










70 
























<210> 


32 




























<211> 


119 




























<212> 


PRT 




























<213> 


Homo sapiens 






















<400> 


32 


























Cys 


Pro 


Ser 


Asp 


Arg 


Phe 


Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


lie 


Pro 


Asn 


1 








5 










10 










15 




Arg 


Trp 


Leu 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 








20 










25 










30 






Ser 


Asn 


Ala 


Thr 


Cys 


Ser 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 






35 










40 










45 








Cys 


Ala 


Ser 


Gly 


Arg 


Cys 


He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 




50 










55 










60 










Asp 


Asp 


Cys 


Gly 


Asp 


Arg 


Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


65 










70 










75 










80 


Thr 


Cys 


Phe 


Pro 


Leu 


Thr 


Gin 












Gly Arg 


Cys 












85 










90 










95 




Asn 


He 


Asn 


Trp 


Arg 


Cys 


Asp 


Asn 


Asp 


Asn 


Asp 


Cys 


Gl 

Y 


Asp 


Asn 


Ser 








100 










105 










110 






Asp 


Glu 


Ala 


Gly 


Cys 


Ser 


His 
























115 






























<210> 


33 




























<211> 


166 




























<212> 


PRT 




























<213> 


Homo sapiens 






















<400> 


33 


























Cys 


Pro 


Ser 


Asp 


Arg 


Phe 


Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


1 








5 










10 










15 




Arg 


Trp 


Leu 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Gly 


Asn 


Ser 


Glu 


Asp 


Glu 








20 










25 










30 






Ser 


Asn 


Ala 


Thr 


Cys 


Ser 


Ala 


Arg 


Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 






35 










40 










45 








Cys 


Ala 


Ser 


Gly 


Arg 


Cys 


He 


Pro 


He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 




50 










55 










60 










Asp 


Asp 


Cys 


Gly 


Asp 


Arg 


Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


65 










70 










75 










80 


Thr 


Cys 


Phe 


Pro 


Leu 


Thr 


Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 










85 










90 










95 




Asn 


He 


Asn 


Trp 


Arg 


Cys 


Asp 


Asn 


Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Asn 


Ser 








100 










105 










110 






Asp 


Glu 


Ala 


Gly 


Cys 


Ser 


His 


Ser 


Cys 


Ser 


Ser 


Thr 


Gin 


Phe 


Lys 


Cys 






115 










120 










125 








Asn 


Ser 


Gly 


Arg 


Cys 


He 


Pro 


Glu 


His 


Trp 


Thr 


Cys 


Asp 


Gly 


Asp 


Asn 




130 










135 










140 










Asp 


Cys 


Gly 


Asp 


Tyr 


Ser 


Asp 


Glu 


Thr 


His 


Ala 


Asn 


Cys 


Thr 


Asn 


Gin 


145 










150 










155 










160 


Ala 


Thr 


Arg 


Pro 


Pro 


Gly 






























165 

























<210> 34 



35 



<211> 108 
<212> PRT 

<213> Homo sapiens 



Cys 


Pro 


Ser 


Asp 


Arg 


Phe 


Lys 


Cys 


Glu 


Asn 


Asn 


Arg 


Cys 


He 


Pro 


Asn 


1 








5 










10 










15 




Arg 


Trp 


Leu 


Cys 


Asp 


Gly 


Asp 


Asn 


Asp 


Cys 


Gly Asn 


Ser 


Glu 


Asp 


Glu 








20 










25 










30 






Ser 


Asn 


Ala 
35 


Thr 


Cys 


Ser 


Ala 


Arg 
40 


Thr 


Cys 


Pro 


Pro 


Asn 
45 


Gin 


Phe 


Ser 


Cys 


Ala 
50 


Ser 


Gly 


Arg 


Cys 


He 
55 


Pro 


He 


Ser 


Trp 


Thr 
60 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly 


Asp 


Arg 


Ser 


Asp 


Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


65 










70 










75 










80 


Thr 


Cys 


Phe 


Pro 


Leu 


Thr 


Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly Arg 


Cys 


He 










85 










90 










95 




Asn 


lie 


Asn 


Trp 
100 


Arg 


Cys 


Asp 


Asn 


Asp 
105 


Asn 


Asp 


Cys 











<210> 35 

<211> 289 

<212> PRT 

<213> Homo sapiens 



<400> 35 
Cys Pro Ser Asp 
1 

Arg Trp Leu Cys 
20 

Ser Asn Ala Thr 
35 

Cys Ala Ser Gly 
50 

Asp Asp Cys Gly 
65 

Thr Cys Phe Pro 

Asn He Asn Trp 
100 

Asp Glu Ala Gly 
115 

Asn Ser Gly Arg 
130 

Asp Cys Gly Asp 
145 

Ala Thr Arg Pro 

Leu Asp Gly Leu 
180 

Asp Cys Met Asp 
195 

Val Cys Asp Pro 
210 

lie Ser Lys Ala 
225 

Ser Asp Glu Glu 



Arg Phe Lys Cys 
5 

Asp Gly Asp Asn 

Cys Ser Ala Arg 
40 

Arg Cys He Pro 
55 

Asp Arg Ser Asp 
70 

Leu Thr Gin Phe 
85 

Arg Cys Asp Asn 

Cys Ser His Ser 
120 

Cys He Pro Glu 
135 

Tyr Ser Asp Glu 
150 

Pro Gly Gly Cys 
165 

Cys He Pro Leu 

Ser Ser Asp Glu 
200 

Ser Val Lys Phe 
215 

Trp Val Cys Asp 
230 

Asn Cys Glu Ser 
245 



Glu Asn Asn Arg 
10 

Asp Cys Gly Asn 
25 

Thr Cys Pro Pro 

He Ser Trp Thr 
60 

Glu Ser Ala Ser 
75 

Thr Cys Asn Asn 
90 

Asp Asn Asp Cys 
105 

Cys Ser Ser Thr 

His Trp Thr Cys 
140 

Thr His Ala Asn 
155 

His Thr Asp Glu 
170 

Arg Trp Arg Cys 
185 

Lys Ser Cys Glu 

Gly Cys Lys Asp 
220 

Gly Asp Asn Asp 
235 

Leu Ala Cys Arg 
250 



Cys He Pro Asn 
15 

Ser Glu Asp Glu 
30 

Asn Gin Phe Ser 
45 

Cys Asp Leu Asp 

Cys Ala Tyr Pro 
80 

Gly Arg Cys He 
95 

Gly Asp Asn Ser 
110 

Gin Phe Lys Cys 
125 

Asp Gly Asp Asn 

Cys Thr Asn Gin 
160 

Phe Gin Cys Arg 
175 

Asp Gly Asp Thr 
190 

Gly Val Thr His 
205 

Ser Ala Arg Cys 

Cys Glu Asp Asn 
240 

Pro Pro Ser His 
255 



36 



Pro Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys 

260 265 270 

Asp Gly Asn Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys 
275 280 285 

Asp 



<210> 36 

<211> 40 

<212> PRT 

<213> Homo sapiens 



<400> 36 
Thr Cys Pro Pro Asn Gin Phe Ser 

1 5 
lie Ser Trp Thr Cys Asp Leu Asp 
20 

Glu Ser Ala Ser Cys Ala Tyr Pro 
35 40 



Cys Ala Ser Gly Arg Cys lie Pro 

10 15 

Asp Asp Cys Gly Asp Arg Ser Asp 
25 30 



<210> 37 

<211> 79 

<212> PRT 

<213> Homo sapie 



Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser 


Gly Arg Cys 


He 


Pro 


1 








5 










10 






15 




He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys 


Gly Asp Arg 


Ser 


Asp 








20 










25 






30 






Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


Thr 


Cys 


Phe 


Pro Leu Thr 


Gin 


Phe 






35 










40 








45 






Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 


Asn 


He 


Asn 


Trp Arg Cys 


Asp 


Asn 




50 










55 










60 






Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Asn 


Ser 


Asp 


Glu 


Ala 


Gly Cys Ser 


His 




65 










70 










75 









<210> 38 
<211> 126 
<212> PRT 

<213> Homo sapiens 



<400> 38 



Thr 


Cys 


Pro 


Pro 


Asn 


Gin 


Phe 


Ser 


Cys 


Ala 


Ser Gly Arg 


Cys 


He 


Pro 


1 








5 










10 






15 




He 


Ser 


Trp 


Thr 


Cys 


Asp 


Leu 


Asp 


Asp 


Asp 


Cys Gly Asp 


Arg 


Ser 


Asp 








20 










25 






30 






Glu 


Ser 


Ala 


Ser 


Cys 


Ala 


Tyr 


Pro 


Thr 


Cys 


Phe Pro Leu 


Thr 


Gin 


Phe 






35 










40 






45 








Thr 


Cys 


Asn 


Asn 


Gly 


Arg 


Cys 


He 


Asn 


He 


Asn Trp Arg 


Cys 


Asp 


Asn 




50 










55 








60 








Asp 


Asn 


Asp 


Cys 


Gly 


Asp 


Asn 


Ser 


Asp 


Glu 


Ala Gly Cys 


Ser 


His 


Ser 


65 










70 










75 






80 


Cys 


Ser 


Ser 


Thr 


Gin 


Phe 


Lys 


Cys 


Asn 


Ser 


Gly Arg Cys 


He 


Pro 


Glu 










85 










90 






95 




His 


Trp 


Thr 


Cys 


Asp 


Gly Asp 


Asn 


Asp 


Cys 


Gly Asp Tyr 


Ser 


Asp 


Glu 








100 










105 






110 







37 



Thr His Ala Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly 
115 120 125 



<210> 39 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<400> 39 

Thr Cys Pro Pro Asn Gin Phe Ser Cys Ala Ser Gly Arg Cys lie Pro 

15 10 15 

lie Ser Trp Thr Cys Asp Leu Asp Asp Asp Cys Gly Asp Arg Ser Asp 

20 25 30 

Glu Ser Ala Ser Cys Ala Tyr Pro Thr Cys Phe Pro Leu Thr Gin Phe 

35 40 45 

Thr Cys Asn Asn Gly Arg Cys lie Asn He Asn Trp Arg Cys Asp Asn 

50 55 60 

Asp Asn Asp Cys 
65 



<210> 40 
<211> 248 
<212> PRT 

<213> Homo sapiens 



<400> 40 
Cys Pro Pro Asn 
1 

Ser Trp Thr Cys 
20 

Ser Ala Ser Cys 
35 

Cys Asn Asn Gly 
50 

Asn Asp Cys Gly 
65 

Ser Ser Thr Gin 

Trp Thr Cys Asp 
100 

His Ala Asn Cys 
115 

Thr Asp Glu Phe 
130 

Trp Arg Cys Asp 
145 

Ser Cys Glu Gly 

Cys Lys Asp Ser 
180 

Asp Asn Asp Cys 
195 

Ala Cys Arg Pro 
210 

Leu Pro Pro Asp 
225 

Ser Asp Glu Gly 



Gin Phe Ser Cys 
5 

Asp Leu Asp Asp 

Ala Tyr Pro Thr 
40 

Arg Cys He Asn 
55 

Asp Asn Ser Asp 
70 

Phe Lys Cys Asn 
85 

Gly Asp Asn Asp 

Thr Asn Gin Ala 
120 

Gin Cys Arg Leu 
135 

Gly Asp Thr Asp 
150 

Val Thr His Val 
165 

Ala Arg Cys He 

Glu Asp Asn Ser 
200 

Pro Ser His Pro 
215 

Lys Leu Cys Asp 
230 

Glu Leu Cys Asp 
245 



Ala Ser Gly Arg 
10 

Asp Cys Gly Asp 
25 

Cys Phe Pro Leu 

He Asn Trp Arg 
60 

Glu Ala Gly Cys 
75 

Ser Gly Arg Cys 
90 

Cys Gly Asp Tyr 
105 

Thr Arg Pro Pro 

Asp Gly Leu Cys 
14 0 

Cys Met Asp Ser 
155 

Cys Asp Pro Ser 
170 

Ser Lys Ala Trp 
185 

Asp Glu Glu Asn 

Cys Ala Asn Asn 
220 

Gly Asn Asp Asp 
235 



Cys He Pro He 
15 

Arg Ser Asp Glu 
30 

Thr Gin Phe Thr 
45 

Cys Asp Asn Asp 

Ser His Ser Cys 
80 

He Pro Glu His 
95 

Ser Asp Glu Thr 
110 

Gly Gly Cys His 
125 

He Pro Leu Arg 

Ser Asp Glu Lys 
160 

Val Lys Phe Gly 
175 

Val Cys Asp Gly 
190 

Cys Glu Ser Leu 
205 

Thr Ser Val Cys 

Cys Gly Asp Gly 
240 



38 



<210> 41 

<211> 39 

<212> PRT 

<213> Homo sapi' 



<400> 41 

Thr Cys Phe Pro Leu Thr Gin Phe 

1 5 
Asn lie Asn Trp Arg Cys Asp Asn 
20 

Asp Glu Ala Gly Cys Ser His 
35 



Thr Cys Asn Asn Gly Arg Cys lie 

10 15 
Asp Asn Asp Cys Gly Asp Asn Ser 
25 30 



<210> 42 
<211> 86 
<212> PRT 

<213> Homo sapiens 



<400> 42 



Thr 


Cys 


Phe Pro 


Leu 


Thr 


Gin 


Phe 


Thr 


Cys 


Asn 


Asn 


Gly Arg 


Cys 


He 


1 




5 










10 










15 




Asn 


lie 


Asn Trp 
20 


Arg 


Cys 


Asp 


Asn 


Asp 
25 


Asn 


Asp 


Cys 


Gly 


Asp 
30 


Asn 


Ser 


Asp 


Glu 


Ala Gly 


Cys 


Ser 


His 


Ser 


Cys 


Ser 


Ser 


Thr 


Gin 


Phe 


Lys 


Cys 




35 








40 










45 








Asn Ser Gly Arg 


Cys 


He 


Pro 


Glu 


His 


Trp 


Thr 


Cys 


Asp 


Gly Asp 


Asn 




50 








55 










60 










Asp 


Cys 


Gly Asp 


Tyr 


Ser 


Asp 


Glu 


Thr 


His 


Ala 


Asn 


Cys 


Thr 


Asn 


Gin 


65 








70 










75 










80 


Ala 


Thr 


Arg Pro 


Pro 
85 


Gly 























<210> 43 
<211> 169 
<212> PRT 

<213> Homo sapiens 



<400> 43 

Thr Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys He 

15 10 15 

Asn He Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser 

20 25 30 

Asp Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys 

35 40 45 

Asn Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn 

50 55 60 

Asp Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin 
65 70 75 80 

Ala Thr Arg Pro Pro Gly Gly Cys His Thr Asp Glu Phe Gin Cys Arg 

85 90 95 

Leu Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr 

100 105 HO 

Asp Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His 

115 120 125 

Val Cys Asp Pro Ser Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys 

130 135 140 

He Ser Lys Ala Trp Val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn 
145 150 155 160 



39 



Ser Asp Glu Glu Asn Cys Glu Ser Leu 
165 



<210> 44 
<211> 209 
<212> PRT 
<213> Homo sapie 



<400> 44 
Thr Cys Phe Pro 
1 

Asn lie Asn Trp 
20 

Asp Glu Ala Gly 
35 

Asn Ser Gly Arg 
50 

Asp Cys Gly Asp 
65 

Ala Thr Arg Pro 

Leu Asp Gly Leu 
100 

Asp Cys Met Asp 

115 

Val Cys Asp Pro 
130 

lie Ser Lys Ala 
145 

Ser Asp Glu Glu 

Pro Cys Ala Asn 
180 

Asp Gly Asn Asp 
195 

Asp 



Leu Thr Gin Phe 
5 

Arg Cys Asp Asn 

Cys Ser His Ser 
40 

Cys lie Pro Glu 
55 

Tyr Ser Asp Glu 
70 

Pro Gly Gly Cys 
85 

Cys lie Pro Leu 

Ser Ser Asp Glu 
120 

Ser Val Lys Phe 
135 

Trp Val Cys Asp 
150 

Asn Cys Glu Ser 
165 

Asn Thr Ser Val 

Asp Cys Gly Asp 
200 



Thr Cys Asn Asn 
10 

Asp Asn Asp Cys 
25 

Cys Ser Ser Thr 

His Trp Thr Cys 
60 

Thr His Ala Asn 
75 

His Thr Asp Glu 
90 

Arg Trp Arg Cys 
105 

Lys Ser Cys Glu 

Gly Cys Lys Asp 
140 

Gly Asp Asn Asp 
155 

Leu Ala Cys Arg 
170 

Cys Leu Pro Pro 
185 

Gly Ser Asp Glu 



Gly Arg Cys lie 
15 

Gly Asp Asn Ser 
30 

Gin Phe Lys Cys 
45 

Asp Gly Asp Asn 

Cys Thr Asn Gin 
80 

Phe Gin Cys Arg 
95 

Asp Gly Asp Thr 
110 

Gly Val Thr His 
125 

Ser Ala Arg Cys 

Cys Glu Asp Asn 
160 

Pro Pro Ser His 
175 

Asp Lys Leu Cys 
190 

Gly Glu Leu Cys 
205 



<210> 45 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys lie Pro 

15 10 15 

Glu His Trp Thr Cys Asp Gly Asp Asn Asp Cys Gly Asp Tyr Ser Asp 

20 25 30 

Glu Thr His Ala Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly 
35 40 45 

<210> 46 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 46 



40 



Ser 


Cys 


Ser 


Ser 


Thr 


Gin 


Phe 


Lys 


Cys 


Asn 


Ser 


Gly Arg 


Cys 


lie 


Pro 


1 






5 










10 








15 




Glu 


His 


Trp 


Thr 


Cys 


Asp 


Gly Asp 


Asn 


Asp 


Cys 


Gly Asp 


Tyr 


Ser 


Asp 






20 










25 








30 






Glu 


Thr 


His 


Ala 


Asn 


Cys 


Thr 


Asn 


Gin 


Ala 


Thr 


Arg Pro 


Pro 


Gly 


Gly 






35 










40 








45 








Cys 


His 


Thr 


Asp 


Glu 


Phe 


Gin 


Cys 


Arg 


Leu 


Asp 


Gly Leu 


Cys 


lie 


Pro 


50 








55 










60 








Leu 


Arg 


Trp 


Arg 


Cys 


Asp 


Gly 


Asp 


Thr 


Asp 


Cys 


Met Asp 


Ser 


Ser 


Asp 


65 










70 










75 








80 


Glu 


Lys 


Ser 


Cys 


Glu 
85 


Gly 


Val 


Thr 


His 















<210> 47 
<211> 170 
<212> PRT 

<213> Homo sapiens 



<400> 47 

Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys He Pro 

15 10 15 

Glu His Trp Thr Cys Asp Gly Asp Asn Asp Cys Gly Asp Tyr Ser Asp 

20 25 30 

Glu Thr His Ala Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly Gly 

35 40 45 

Cys His Thr Asp Glu Phe Gin Cys Arg Leu Asp Gly Leu Cys He Pro 

50 55 60 

Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp Cys Met Asp Ser Ser Asp 
65 70 75 80 

Glu Lys Ser Cys Glu Gly Val Thr His Val Cys Asp Pro Ser Val Lys 

85 90 95 

Phe Gly Cys Lys Asp Ser Ala Arg Cys He Ser Lys Ala Trp Val Cys 

100 105 HO 

Asp Gly Asp Asn Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn Cys Glu 

115 120 125 

Ser Leu Ala Cys Arg Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser 

130 135 140 

Val Cys Leu Pro Pro Asp Lys Leu Cys Asp Gly Asn Asp Asp Cys Gly 
145 150 155 160 

Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 
165 170 



<210> 48 
<211> 42 
<212> PRT 

<213> Homo sapiens 



<400> 48 

Gly Cys His Thr Asp Glu Phe Gin Cys Arg Leu Asp Gly Leu Cys He 

15 10 15 

Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp Cys Met Asp Ser Ser 

20 25 30 

Asp Glu Lys Ser Cys Glu Gly Val Thr His 
35 40 



<210> 49 
<211> 83 
<212> PRT 



41 



<213> Homo sapiens 



<400> 49 
Gly Cys His Thr Asp Glu Phe Gin 

1 5 
Pro Leu Arg Trp Arg Cys Asp Gly 
20 

Asp Glu Lys Ser Cys Glu Gly Val 

35 40 
Lys Phe Gly Cys Lys Asp Ser Ala 

50 55 
Cys Asp Gly Asp Asn Asp Cys Glu 
65 70 
Glu Ser Leu 



Cys Arg Leu Asp Gly Leu Cys He 

10 15 
Asp Thr Asp Cys Met Asp Ser Ser 
25 30 
Thr His Val Cys Asp Pro Ser Val 
45 

Arg Cys He Ser Lys Ala Trp Val 
60 

Asp Asn Ser Asp Glu Glu Asn Cys 
75 80 



<210> 50 
<211> 123 
<212> PRT 

<213> Homo sapiens 



<400> 50 

Gly Cys His Thr Asp Glu Phe Gin 

1 5 
Pro Leu Arg Trp Arg Cys Asp Gly 
20 

Asp Glu Lys Ser Cys Glu Gly Val 

35 40 
Lys Phe Gly Cys Lys Asp Ser Ala 

50 55 
Cys Asp Gly Asp Asn Asp Cys Glu 
65 70 
Glu Ser Leu Ala Cys Arg Pro Pro 
85 

Ser Val Cys Leu Pro Pro Asp Lys 
100 

Gly Asp Gly Ser Asp Glu Gly Glu 
115 120 



Cys Arg Leu Asp Gly Leu Cys He 

10 15 
Asp Thr Asp Cys Met Asp Ser Ser 
25 30 
Thr His Val Cys Asp Pro Ser Val 
45 

Arg Cys He Ser Lys Ala Trp Val 
60 

Asp Asn Ser Asp Glu Glu Asn Cys 

75 80 
Ser His Pro Cys Ala Asn Asn Thr 

90 95 
Leu Cys Asp Gly Asn Asp Asp Cys 
105 HO 
Leu Cys Asp 



<210> 51 
<211> 41 
<212> PRT 

<213> Homo sapiens 



<400> 51 

Val Cys Asp Pro Ser Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys 

1 5 10 15 

He Ser Lys Ala Trp Val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn 

20 25 30 

Ser Asp Glu Glu Asn Cys Glu Ser Leu 
35 40 



<210> 52 

<211> 81 

<212> PRT 

<213> Homo sapiens 



<400> 52 



42 



Val Cys Asp Pro Ser Val Lys Phe 

1 5 
He Ser Lys Ala Trp Val Cys Asp 
20 

Ser Asp Glu Glu Asn Cys Glu Ser 

35 40 
Pro Cys Ala Asn Asn Thr Ser Val 

50 55 
Asp Gly Asn Asp Asp Cys Gly Asp 
65 70 
Asp 



Gly Cys Lys Asp Ser Ala Arg Cys 

10 15 
Gly Asp Asn Asp Cys Glu Asp Asn 
25 30 
Leu Ala Cys Arg Pro Pro Ser His 
45 

Cys Leu Pro Pro Asp Lys Leu Cys 
60 

Gly Ser Asp Glu Gly Glu Leu Cys 
75 80 



<210> 53 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<400> 53 

Ala Cys Arg Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser Val Cys 

10 15 



Leu 



Pro Pro Asp Lys Leu Cys Asp Gly Asn Asp Asp Cys Gly Asp Gly 



Ser Asp Glu Gly Glu Leu Cys Asp 
35 40 

<210> 54 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Ser Gly Phe Ser Leu Gly Ser Asp Gly Lys 



<210> 55 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 55 

Gly He Ala Leu Asp Pro Ala Met Gly Lys 



<210> 56 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 56 

Gly Gly Ala Leu His He Tyr His Gin Arg 



<210> 57 
<211> 11 
<212> PRT 

<213> Homo sapiens 



43 



r 



Val Phe Phe Thr Asp Tyr Gly Gin Tie Pro Lys 



44 



